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ABSTRACT 


These proceedings report the results of a workshop on identification and control of 
flexible space structures held in San Diego. CA, July 4-6, 1984. The workshop was 
co-sponsored by the Jet Propulsion Laboratory and the NASA Langley Research Center, 
and preceded the 1984 American Control Conference held at the same location. The 
main objectives of the workshop were to provide a forum to exchange ideas in exploring 
the most advanced modeling, estimation, identification and control methodologies to 
flexible space structures. The workshop responded to the rapidly growing interest 
within NASA in large space systems (space station, platforms, antennas, flight 
experiments) currently under design The workshop consisted of surveys, tutorials, 
contributed papers, and discussion sessions in the following general areas: missions of 
current interest - space platforms, antennas, and flight experiments; control/structure 
interactions - modeling, integrated design and optimization, control and stabilization, 
and shape control; uncertain ly mana gement - parameter identification, model error 
estimation/compensation, and adaptive control; and **peritn«» ntal evaluation - ground 
laboratory demonstrations and flight experiment designs. Papers and lectures on these 
topics were presented at a total of fourteen sessions, including three panel discussions. 


iii 
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A STUDY ON THE CONTROL OF 
THIRD GENERATION SPACECRAFT* 

E. J. Eavfeon and W. Going 

University of Toronto 
Toronto, Ontario, Canada MSS 1A4 


ABSTRACT 

An overview of some studies which have recently been carried out in [l]-[3] 
on the control of third -generation spacecraft, as modelled by the M^AT space 
vehicle configuration, is made. This spacecraft is highly non- symmetrical and 
lias appendages which cannot in general be assumed to be rigid. In particular, it 
is desired to design a controller for MSAT which stabilizes the system and satis- 
fies certain attitude control, shape control, and possibly station-keeping re- 
quirements; in addition, it is desired that the resultant controller should be 
robust and avoid any undesirable "spill-over effects". In addition, the control- 
ler obtained should have minimum complexity. 

The method of solution adopted to solve this class of problems is to formu- 
late the problem as a robust servomechanism problem [5] -[7], and thence to obtain 
existence conditions and a controller characterization to solve the problem. 

The final controller obtained for MSAT has a distributed control configura- 
tion and appears to be quite satisfactory. 

INTRODUCTION 

This paper summarizes studies carried out in [l]-[3] on control system 
structures known as third-generation spacecraft. Such spacecraft have: 

(1) Large mass 

(2) High power 

(5) Large non- symmetric flexible appendages 

(4) Precise communication RF bean control requirements. 

In particular, the class of spacecraft represented by the Mobile Communications 
Satellite (MSAT) is used as a reference for these studies. This spacecraft has 
non- symmetric appendages which cannot be assumed to be rigid (see Figure 1) . 

There are a number of control problems associated with the attitude-control, 
shape-control and possibly station-keeping control for such third generation 
spacecraft (referred to as LFSS) , which may be listed as follows: 

A . Ti.e LFS S Control Problem 

Problem 1: Lightly Lamped, Oscillatory Plan 1 ' 

A LFSS has eigenvalues either at the origin or approximately distributed a- 
long the imaginary axis. One of the basic objectives that a controller must ac- 
complish in this case is to stabilize the rigid body modes of the LFSS, and at 
the same time to stabilize the elastic modes of the LFSS. This is called the 
LFSS stabilization problem. 

•This work was supported by the Department of Communication? , Ottawa, Canada under 
contracts DOC-CR-SP-82-007, D0C-CR-SP-S3-002, DOC-CR-SP-84-002. 
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Problem 2: Modelling 

In modelling a LFSS, experience has shown that dynamic analysis may provide a 
framework for the model 1 ing of the low frequency elastic modes of the LFSS in a 
reasonably accurate way, but that the high frequency elastic modes cannot be ex- 
pected to be determined accurately, i.e. there will always be errors present in 
modelling the high frequency elastic modes of the LFSS. In addition, the calcul- 
ation of dampening effects on the LFSS can only be done with great uncertainty. 

Problem 3: The Infinite Dimensional Plant - The "Spill-Over Problem" 

The classical modelling of elastic structures as continua results in the well 
known "infinite dimensional" system reprc station of a LFSS. Whether or not one 
adopts this infinite dimensionality representation seriously from an engineering 
standpoint, there is no question that the number of system elastic modes present 
in a LFSS is always larger than the number which any design model of a LFSS can 
accommodate. In trying to control the modelled rigid and elastic modes, it is 
essential that the controller should not cause. these unmodelled high frequency 
elastic modes to become unstable. This is called the "Spill-Over Problem". 


Problem 4: The Sensor/Actuator Placement Problem 

The LFSS is intrinsically distributed, and the configuration of control hard- 
ware is not in general specified. Thus, unlike many conventional control problems, 
part of the LFSS control problem is in determining the number and location of 
sensor/actuators on the LFSS. 

Problem 5: Requirement for Multivariable Control Theory 

The concept of "third generation" spacecraft, unlike the first and some 
second generation spacecraft, precludes single-input, single-output control design. 
Some type of multivariable control design method is mandatory to deal with the 
severe interaction occurring in the system. 

Problem 6: Minimization of Number of Sensors/ Actuators 

Th. j is a practical limiation on the quantity of hardware that can be dis- 
tributed over the LFSS vehicle. This implies in particular that one cannot assume 
full state feedback is available, and that the number of actuators/ sensors used 
must be limited, i.e. one must minimize any unnecessary sensor/actuators required 
for LFSS control. 

The following problem definition is now given: 

B. The LFSS Robust Servomechanism Problem 

Assume that a LFSS can be exactly described by the following finite dimen- 
sional linear time invariant model: 

x = Ax + Bu + Eio 

y = Cx + Fto (1) 
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n <4 

where xeR is the state, ueR is the control (actuator inputs), y eR m are the 

r ® o 

measured (sensor) outputs, and yeR are the outputs to be regulated. Here u)eR 

are assumed to be constant unmeasurable disturbances applied to the structure, 
w * 

<^ m eR m are assumed to be constant unknown measurement errors and e = y-y ^ is the 

error in the system where y re £ is a constant set-point. Thus, it is assumed that 

(1) may include an arbitrarily large number of elastic modes (but not infinite). 

Assume now that an approximate model of (1), called the design model for (1), 
is given by: 


x = Ax + Bu + Eio 


y = Cx + Foj 


( 2 ) 


where xeR is the state of the design model, and where n«n. It is desired now 
to find a controller based on the design model (2), such that when it is applied 
to (1), the system is asymptotically stable, i.e. no spill-over occurs, and such 
that : 


lim e(t) = 0 , Vx(0)eR n , VtoeR , Vw eR m (3) 

m 

This is called the LFSS Robust Servomechanism Problem, which includes the follow- 
ing subproblems: 

(1) Stabilization 

(2) Station-keeping 

(3) Attitude control 

(4) Shape control. 


THE MSAT CONTROL PROBLEM 

The MSAT spacecraft is illustrated in Figure 1. It consists of four compon- 
ents, one of wich is rigid (the bus) and three of which are flexible (the solar 
array, the tower, and the reflector). The tower-reflector-hub hinge point is 
assumed to have a gimbal (see Figure 2) . 

The coordinates assumed for each of these substructures are as follows: 

(1) Bus - three rigid rotations ( 0 x » 0 y» 0 z )' 

(2) Tower - relative displacement of tower tip to tower root 

- relative angular displacement of reflector with respect to 
frame fixed at tower root (with zero gimbal angles) 

( a l ,a 2 ,C V ' 

(3) Reflector - two gimbal angles at tower-reflector-hub hinge point (8^,8.,)' 

The actuators which are assumed to be available are as follows: 

(1) Eight thrusters f^, i=l,2,...,8, four from thrusters on the bus and four 

from thrusters at the reflector hinge point, aligned as shown in Figure 2. 
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( 2 ) 


Two torquers at the reflector hub, one about each gimbal axis 
Figure 2) . 




(see 


In this case, a design model and an evaluation model was developed in [4], 
in which the design model has 18 states consisting of S rigid body modes (corre- 
sponding to the three rigid rotations of the bus and two gimbal angles of the 
reflector) together with 4 elastic modes, and the evaluation model has 32 states 
consisting of 5 rigid body modes and 11 elastic modes. Table 1 gives the eigen- 
values of the open loop system for the two models. The models used in this study 
included the effect of dampening terms D, Dg (see Table 1) . 


TABLE 1 : Open Loop Eigenvalues of MSAT Vehicle 



Standard Design Model 

Evaluation Model 


With Damping 

With Damping 

With Damping 

With Damping 


Term D 

Term D 

Term Dg 

Term Dg 


Excluded 

Included 

Excluded 

Included 

Rigid 

Body 

Modes 

0 (repeated 

0 (repeated 

0 (repeated 

0 (repeated 

10 times) 

10 times) 

10 times) 

10 times) 


0±j0. 124 

-0.000923±j0. 124 

0±j0.124 

-0.000923±j 0.124 


0±j0. 239 

-0.00170 ±j0.240 

0±j0.151 

-0. 000 853±j 0.151 

Elastic 

0±j 0.556 

-0.00856 ±j0.556 

0±j0.239 

-0.00171 ±j0.239 

0+j 0.780 

-0.0211 ±j 0.779 

0±j 0.556 

-0.00856 ±j 0.556 

Body 



0±j0.690 

-0.00553 ±j0.690 

Modes 



0±j0.780 

-0.0211 ±j 0.780 



0±j 1.55 

-0.0751 ±jl.55 




0±j3. 14 

-0.0280 ±j3.14 




0±j3.96 

-0.0528 ±j3.96 




0±j9 .95 

-0.524 ±j 10. 1 




0±jl4.0 

-1.17 ±jl3.8 


It may be noted that the elastic modes of the evaluation model interweave with 
the elastic modes of the design model. 

A. Description of Problem to be Solved 

Ii, this case it is desired to solve the LFSS Robust Servomechanism Problem 
for the MSAT vehicle. In particular, there are two separate requirements for the 
controller to be designed for the MSAT vehicle: 

Requirement I 

Find a controller, based on the MSAT design model, which solves the following 
problems: 

• Stability: stabilize the 5 rigid body modes and the 4 elastic modes of the 

system. re f 

• Attitude control: regulate 0,9,0 to desired constant set points 0 , 

ref ref X y Z a 

0 , 0 respectively, in the presence of unknown constant disturb- 

y z 

ances. 
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• Shape control: regulate ®2 +a 2* ^ ^ 1 ’ ^ ^^2’ ^"^ 3 * a 3 to zer0 » zn 

the presence of unknown constant disturbances. 

• Spill-over problem: it is desired that the controller should satisfy the 

above requirements, and not cause any instability to occur with respect 
to any of the vehicle's elastic modes which are not included in the 
design model. 

• Controller complexity: it is desired to minimize the number of sensors and 

actuators which are required to solve the problem. 

• Discrete controller implementation: it is desired that the controller, when 

implemented digitally, should not require an excessively large sampling 
rate to maintain stability. 


Requirement II 

Apply the controller obtained, based on the MSAT design model, to the MSAT 
evaluation model, and verify that all objectives above are satisfied. 

The outputs to be regulated in this case are given by: 


y = ( 0 x *e ,e z ,6 1 +a 1 ,$ 2 +a 2 ,f" 1 6 1 ,f" 1 6 2 ,f" 1 6 3 ,a 3 )' 


(4) 


B. Assumptions Hade in Problem Formulation 

In this problem, it is assumed that there is no requirement for controlling 
the w x> uiy, rigid body modes. (Note: this assumption is not essential, e.g. 

12], 13] also deals with the case of station-keeping.) It is also assumed that 
there is no need to include any gyroscopic terms in the design and evaluation 
models . 


METHOD OF SOLUTION ADOPTED TO OBTAIN 
A CONTROLLER TO SOLVE PROBLEM 


The method of approach adopted to solve this problem was based on using the 
results of the "robust servomechanism problem" [5] -[7], in conjunction with a 
parameter optimization method [ 8 ] to determine the controller's parameters, e.v 
see [9] which solves a special case of the above problem when the sensors and 
actuators are collocated, using a decentralized control configuration. In this 
case, existence conditions for a solution to the problem were obtained, and a 
necessary controller structure developed. In particular, it was found that any 
controller which solves the MSAT problem specifications must consist of a "servo- 
cornensator" [5] (unique), together with a stabilizing compensator (non -unique ) . 
In this study, the simplest possible stabilizing compensator, i.e. a stabilizing 
compensator consisting of only proportional and rate feedback terms, was used. 

In this case, in order to satisfy the existence conditions obtained for a 
solution to exist to the problem, it was necessary to choose the following inputs 
(actuators) and measurable outputs (sensors) for the controller: 


Outputs (sensors) : 


y m = (e x ,6 y ,e z ,6 l’ e 2 ,0t r a 2 ,f 6 l* f V 


(S) 


5 



( 6 ) 


Inputs (actuators) : 

u * (8 c.' 8 e i - 8 6 2 - f i' f 2' f 5- f 6 ) ' 

* * * * * 

where g^ , fj, f 7 , f,., correspond to various combinations of the thrusters 
f j ,t' 2 , . .? ,fg (see Figure 2), as described in Appendix I. 


FINAL CONTROLLER CONFIGURATION OBTAINED 


In this case, »„e following distributed controller was obtained as a solu- 
tion to the MSAT robust servomechanism problem, based on the MSAT design model: 


g* 
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where s denotes the Laplace Transform operator, where 
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and where Kj, K 2> Kj, K 4> K 5 , y are given as follows: 


1.43 

0.500 

24.7 

1.34 

-0.0460 

0.0255 

4.64 

1.12 

15.6 

-0.000439 

-6.81 

-0.000957 

0.00981 

-0.000483 

18.6 

0.00326 

38.0 

-0.231 

14.5 

-0.00955 

59.0 

-0.00916 

0.0216 

0.0127 

-2.40 
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28.5 

0.510 

-136 

0.0653 

1180 

" 7. 14*10"' 
1.28*10*' 
-3.41*10*' 
1.63*10"* 
2.95*10"' 

-0.464 

-0.0144 

-0.00438 

0.136 

-0.000753 

0.0268 

0.0273 


10.0 

92.8 

-0.0191 

760 

-0.183 

2.50*10 

2.32*10 

-4.79*10 

1.90*10 

-4.58*10 

0.0433 

0.0536 

0.201 

0.0461 

1.13 

0.225 

-0.226 


0.0268 0.225 

0.0273 -0.226 


494 26.7 

22.3 312 

0.196 -0.00965 

-4.63 290 

0.432 0.254 


-0.920 

-0.00877 

372.2 

-0.191 

-48.1 


-4 

1 . 24*10~ 2 

6.68*10* 4 

-2.30*10* 5- 

-3 

5.58*10* 4 

7.80*10* 3 

-2 . 19*10* 7 

-7 

4.90*10" 6 

-2.41*10‘ 7 

9 . 31*10* 3 

-2 

-1.16*10* 4 

7.26*10" 3 

-4.78*10* 6 

-6 

1 .08*10" 5 

6.34*10~ 6 

-1 ,20*10” 3 _ 


Y = 2.0*10 


This controller is just a multivariable generalization of the classical 
three term controller used in classical control. The controller has minimal com- 
plexity ; . the sense that it has minimum order feedback dynamics and has the 
minimum number of actuators/sensors required in order to solve the problem. It 
is to be noted that no a priori assumption the distributed structure of (7) 
was made — the distributed structure of the controller (7) arose from the 
analysis automatically. 


PROPERTIES OF PROPOSED CONTROLLER 

The mam features of the proposed controller when applied to the MSAT design 
node! a evaluation model will now be described. The main features of interest 

are : 

(1) The stabilization properties of the proposed controller. 

(2) The steady state regulation px-operties of the proposed controller. 

The following results are obtained: 

A . Eigenvalues of Closed Loop System Using Proposed Controller 

Table 2 gives a listing of all eigenvalues obtained by applying the proposed 
controller (7) to the MSAT design model and evaluation models. 
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TABLE 2 : Listing of Closed Loop Eigenvalues Using Proposed Controller (7) When 

Applied to MSAT Design and Evaluation Models ' 


Standard Design Model 

Evaluation Model 

- 0 . 00047t j 0 . 0085 t 

-0.0024±j0.016 . * . . 

-0. 0051 ±j 0.022 body 

-0.0097±j 0.030 | es 

-0. P10± jO. 031 i 

-0. 0004 7tj 0.0085 t 

-0.0024± j0.016 . .* . 

-O.OOSlij 0.023 ri «^ body 

-0.0097±j 0.030 | es 

-O.OlOt j0.031 J 

-0.00014±j0. 124 1 

-0. 006 l±j 0.240 elastic 

-0.017tj0.557 

-0.029ij0.7S0 

: body 
les 

-0.00014±j0.124 

-0.00020±j0.151 

-0. 00611 j 0.240 

-0.017tj0.557 

-0. 00791 j 0.690 

-0. 00291 j 0.780 elaStl< 

-0. 1291 j 1.35 ■°‘ 

-0. 0671 j 3. 16 

-0. 0691 j 3. 95 

-S.51j8.88 

-O.Slljll .3 

body 

les 

-5.0*10"“ t 

-5.0*10"“ 

-5.0*10-“ 1 

c nxln -4 servo-compensator 

foxier* “°f s 

-5.0*10"“ 

-5.0*10"“ + 

-1.7*10" 3 t 

-5.0*10"“ 

-5.0*10"“ 1 

-5 0*10"“ servo-compensator 

-5.0*10““ “°f es 

-5.0*10'“ 

-5.0*10"“ ♦ 

-2.0*10“ 3 t 

-?'S* 1 lS" 3 feedforward 

-2’ 0*10" 3 controller 

-2 .0*10" 3 ® 0< ? es 

-2.0*10" 3 4- 

-2.0*10" 3 t 

-2.0*10~ 3 e * . 
2 0*10" 3 feedforward 

-2]o*10" 3 controller 

-2.0*10“ 3 “°f es 

-2.0*10" 3 4 


It is observed that the resultant closed loop system is asymptotically stable 

for both the design and evaluation models, i.e. no undesirable spill-over effects 

occur. It is also observed that the dominant time constant of the system is 

mainly associated with the servo-compensator modes. This implies that one would 

expect for the case of tracking, that the dominant time response of the system 

would be associated with the feedforward controller modes, i.e. TC, 4500 sec 

dora 

4 8 min., and for the case of disturbance rejection, that the dominant time of 

the system would be associated with the servo-compensator modes, i.e. TC . 4 

aom 

2000 sec 4 0.6 hrs. This result is verified in the simulation studies to follow. 
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B. Steady-State Values of Outputs Using Proposed Controller : Tracking Case 

Table 5 gives a summary of results obtained for the case of unit step func- 
tion tracking, when the proposed controller (7) is applied to the MSAT design and 
evaluation model. It is observed that all 9 outputs of the system are asympto- 
tically regulated to their correct values as desired. 

TABLE 3 : Steady-State Values of Outputs Using Proposed Controller (7) When 

Applied to Design and Evaluation Model - Tracking Case 



e ref =i 
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.y 

0 ref =l 

z 

0 
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1 

0 

0 

0 

y 

0 

1 

0 

0 

z 

0 

0 . 

1 

V“i 

0 

0 

0 

e 2 *a 

0 

0 

0 


0 

0 

0 


0 

0 

0 


0 

0 

0 

a 3 

0 

0 

0 


Note: Any | number | <10 is assumed to be zero. 


C. Steady-State Values of Outputs Using Proposed Controller: Disturbance 

Rejection Case 


Tables 4 and 5 give a summary of all results obtained for the case of dis- 
turbance rejection, when the proposed controller is applied to the MSAT design 
and evaluation models respectively. In this case, it is assumed that a unit step 
function change occurs for different disturbances corresponding to g c ,g c ,..., 


fg.fg defined in Table 6. It is observed that the first 7 outputs of the system 
are asymptotically regulated to zero, and that the remaining two outputs are 
approximately equal to zero in all cases, as is desired. . 


D. Sampling Rate Requirements for Digital Implementation of Proposed Controller 

If it is assumed that the proposed controller (7) is to be implemented 
digitally, then it is necessary that the sensor outputs and actuator signals be 
updated at a fast enough rate so as to guarantee closed loop stability, when the 
the controller is applied to the evaluation model. In this case, on assuming 
that the sensor and actuator signals are updated at the same rate, it was found 
that a sampling rate of at least 0.1 Hz must be used to implement the proposed 
controller. This requirement is not demanding. 
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Note: Any | number |< 10 ^ is assumed to be zero. 



Note: Any |number|<10 ^ is assumed to be zero. 
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SIMULATIONS OBTAINED USING PROPOSED CONTROLLER 
TO SOLVE MSAT PROBLEM 


This section gives some typical simulations of the closed loop system ob- 
tained by using the proposed controller (7) applied to the MSAT design and evalu- 
ation models. Additional simulation studies are given in [3]. 

TAf 

A. Example No. 1 (Attitude Control: 0^ =1) 

In this example, it is assumed that the system has zero initial conditions, 
that there are no disturbances present, and that a unit step function change of 

+1 occurs in the set point for 0 at t=0, i.e. 0 re ^=l, 0 re ^=O, 0 re ^=G. 

x x y z 

Figure 3 gives a plot of all 9 output variables y given by (4) when the con- 
troller is applied to both the design and evaluation model in this case. It is 
observed that the system’s response is almost decoupled, i.e. the output © x is 
approximately equal to its desired value of +l.at t i 50 min, and that all other S 
outputs are barely excited. 

Figure 4 gives a plot of the 7 control variables u given by (5) for this 
example. 

B. Example No. 2 (Disturbance Rejection: fj.=l) 

In this example, it is assumed that the system has zero initial conditions, 
that all set points are identically equal to zero, and that a unit step function 
change of +1 occurs at t=0 corresponding to a disturbance thrust fj=l, where f5 

is defined in Table 6. This example would correspond to a misaligned thruster 

associated with the proposed controller. 

Figure 5 gives a plot of all 9 output variables y when the controller is 
applied to both the design and evaluation model in this case. It is observed 
that the elastic modes of the vehicle are now excited, and that the output vari- 
ables are asymptotically regulated to zero in approximately 2.7 hours, which is 
consistent with the closed loop eigenvalues of the system given in Table 2. 

Figure 6 gives a plot of the 7 control variables u for this example. 

C. Example No. 3 (Disturbance Rejection: J g =l) 

This example is similar to Example No. 2 except that it is assumed that a 
unit step function of +1 occurs at t=0 corresponding to a disturbance thrust f g =l, 
where f„ is defined in Table 6. This disturbance is representative of an arbi- 
trary constant disturbance which may affect the system. 

Figure 7 gives a plot of all 9 output variables y when the controller is 
applied to both the design and evaluation models in this case. It is observed 
that the elastic modes of the vehicle are now also excited as they were in 
Example No. 2, and that the output variables are satisfactorily asymptotically 
regulated with the same time constant as in Example No. 2. 

Figure 8 gives a plot of the 7 control variables u for this example. 
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TABLE 6: Definition of Disturbances Assumed 



— 
Disturbance forces corresponding to the thrusters 
f r f 2’ f 5 ,f 6 ,f 0’ f 9 res P ectivel y of figure 2 


Disturbance torques corresponding to gg ,gg 
respectively about the gimbal axis 3^,3^ 2 

i c >g c >§ c 

n z 3 

Disturbance torques in the bus about the x,y,z 
axis respectively 


ROBUST PROPERTIES OF CONTROLLER DESIGN METHOD 

A study of the robustness properties of the proposed controller design method 
was carried out [3] . This was done by comparing the controller designs obtained 
using the proposed method to different design models of MSAT. It was concluded 
that the proposed design method appears to be quite insensitive to the type of 
design model used, e.g. all controllers obtained, when based on MSAT design models 
which had at least two dominant elastic body modes included, produced stable 
closed loop systems and give satisfactory tracking/regulation, when applied to 
the MSAT evaluation model. Other studies showed that the controller is robust 
with respect to evaluation models of arbitrary complexity. 

CONCLUSIONS 

This paper gives a brief summary of the work performed in [l]-[3]. In these 
studies, the control system design of a third-generation spacecraft, as modelled 
by the MSAT space configuration is studied. This spacecraft is highly non- 
symmetrical and has appendages which cannot, in gen3ral, be assumed to be rigid; 
the elasticity of these appendages makes the control system design particularly 
demanding. In particular, it is desired to design a controller for MSAT which 
stabilizes the system and satisfies certain attitude control , shape control ard 
possibly station-keeping requirements. In addition, it is desired that the resul- 
tant controller should be robust and avoid any "spill-over effects", i.e. it 
should satisfy the problems' specifications based on only an approximate design 
model for MSAT being available. In addition, the controller obtained should have 
minimum com pl exity , i.e. a minimum number of sensors/actuators should be used. 

The method of solution adopted to solve this class of problems was to formu- 
late the problem as a robust servomechanism problem and thence to obtain existence 
conditions and a controller characterization to solve the problem. In this case, 
the controller obtained must contain a servo-compensator together with a stabiliz- 
ing compensator. 

The final controller obtained for MSAT has a distributed control configura- 
tion, and appears to be quite satisfactory, i.e. extensive testing of the con- 
troller shows that the controller is indeed robust with respect to the choice of 
the design model, and that it satisfies all specifications of the problem state- 
ment. 
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APPENDIX I 


Definitions of g* ,fj,f*,f*,f* 

* 

g is defined in terms of thrusters f . , f_, f_, f. as follows: 
C 7 1 c 3 4 




* 



* 

f_ are definea in terms of thrusters f , , f„, f_, f. as follows: 
2 1 2 3 4 
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Figure 2: Assumed control inputs for MSAT spacecraft (taken from [4]), 
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ATTRACT 

This paper considers the problem of designing a !?r»eor controlle '-r sys- 
tem's subject to inequality variance constraints. A quadratic penalty function 
approach is used to yield a linear controller. Both the weights in the quadratic 
penalty function and the locations of sensors and actuators are selected by 
successive approximations to obtain an optimal design which satisfies the input/ 
output variance constraints. The method is applied to NASA’s 64 meter Hoop- 
Column Space Antenna for satellite communications. In addition the solution for 
the control law, the main feature of these results is the systematic determin- 
ation of actuator design requirements which allow the given input/output perfor- 
mance constraints to be satisfied. 


I. INTRODUCTION 

Consider the task of controlling the linear, stochastic syste. 1 

(la) x = Ax + B(u-tw) , xeR 0 , ueR^, yeR* 1 

(lb) y = Cx 


(lc) 


z = Px + v, zeR 


W) 


'x(O) 


E I w(t) 1 = 0, El w(t) I (x T (o), w T (t), v T (t)) = 


,v(t) 


v v (*), 


*0 0 0 

Wfi(t-x) 

0 0 Vfi(t-x). 


W = diag [... ...], V = diag [... 

such that these four control design goals are net: 

(I) E^U) = lir E y. 2 (t) < of, i = 1 k 

t-H» 

Eji 2 (t) = lin E u^ 2 (t) < y.. 2 , i = 1, .... n 
t-H» 
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(II) Only £ < £ sensors are used 


( 2 ) 



= Mx + v 

from the admissible set of £ sensors described from (lc) 
1 


i \ / "i Tx + v i 


(lc) 


z = 



* Mx + v. 


Z £ / \*l X + Y 


(3) 


(4) 


(III) Only m < m actuators are used 

m 

B(u + w) = l b.(u. + w.) 

i=l 1 1 1 

/rom tfee admissible set of n actuators described from (la) 
n 


B(u + w) = l b.(u. + w.) 

i=l 1 1 1 


(IV) The control u(t) is a linear function of the present and 
past measurements z(t), t t. 

Many engineering control design problems can be stated with performance 
constraints of the form (I). For example, large space telescopes are feasible 

2 1/2 

only if the R!!S pointing errors (E v. ) are within certain bounds 
2 1/2 ' 

{Ej/r") ' < a.) so as to achieve diffraction-limited performance (o^) of the 

optics. The designer may also have the freedom to choose from a number of 
different types of sensors and actuators at a number of different locations. The 
locations and the types of actuators (sensors) determine the vectors b. (m. ) in 
(4) and (lc). 1 1 

A straight-forward approach to accommodate the bounded input/output problem 
(I) yields nonlinear controllers [1-2], violating goal (IV). A straight-forward 
approach to accommodate goals (IV) and (I) is to use a penalty function method 
[3-5], minimizing 
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(5) 


t 

t m \ f (llyllq 2+ llu|| R 2 ) dx , ||y|| Q 2 = y T Qy 

*o 


while adjusting Q and R until (I) is satisfied. These successive approximation 
schemes [3-5] presume a fixed measurement/control structure, and hence do not 
satisfy goals (II) and (III). It is Important to unify the treatment of all 
four goals (I-IV) since it has been shown [6-7] they are inherently interdepen- 
dent problems. In particular, for the isolated problems; [6] has shown the 
optimal sensor and actuator selection for LQG problems (5) with fixed (Q,R), and 
[3-5] have adjusted Q and R to satisfy the constrained-variance problem (I) with 
fixed sensors and actuators (i.e. fixed B, H). 

Unfortunately, the optimal answer for the simultaneous solution of both 
problems turns out not to be the juxtaposition of results [6] and [3-5], due to 
the interdependence of the two problems. 

The purpose of this paper is to present a unified treatment of the entire 
problem (I-IV), which we call the Constrained Variance Sensor/Actuator Selection 
(CVSAS) problem. Sectionll describes the approach. Section III gives the formu- 
las for sensor and actuator effectiveness to deal with goals (II)' and (III). 
Section IV presents the numerical algorithm for Iteratively dealing with goal (I). 
Section V gives the algorithm for solving the entire problem (I-IV). Section VI 
illustrates the application to the Hoop-Column Antenna. 


II. APPROACH 

The solution of the problem with inequality constraints (I) is generally 
not unique. To be a bit more specific than statement (I) we define two variations 
of the problem. The first is called the "Constrained-Input Variance" option of 
the CVSAS. In this option the input constraints In (I) are binding and the 
output constraints in (I) are : elected. 

CIVSAS: The Cons trained -Input Variance. Sensor/Actuator Selection Problem 

Satisfy (II), (III), and with all input-constraints binding > 

-2 2 

(6) u.j E co u 1 * 1, 1*1, ...» m , 

minimize (recall y^ * c^xj, 

(7a) Vy ' t > 1. 

-2 2 

If however , there is no i for which Vi > 1 then minimize 
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(7b) 



with all input constraints binding (6). 

Definition: The phrase ", minimal achievable output performance" for the CIVSAS 

will man the minimum constraint violation in the sense of the minimum value of 
Vy in (7) with input constraints binding (6). 

The CIVSAS problem is useful when one wishes to determine the best perfor- 
mance achievable for a given power limitation on the input devices (actuators). 
That is, for a given set of y. the CIVSAS finds the minimum achievable output 
performance. 1 

The second variation of the CVSAS problem -is called the Constrained Output 
variance Sensor/ Actuator Selection (COVSAS). ' 

COVSAS: The Constrained-Output Variance, Sensor/ Actuator Selection Problem 
Satisfy (II), (III), and with all output constraints binding 


(8) °rV * * • 

minimize 


i = 1. 


• • • » 


k 


» 


(Sa) 


"u - l V * >• 


(9b) 


If however , there is no 1 for which E^U.. > 1 then minimize 

m p o 

1/ = l E u/ 
u j-j i °° 1 


with all output constraints binding , (8). 

Definition 2: The phrase "minimum achievable input performance " for the COVSAS 

will mean the minimum constraint violation in the sense of (9) , with all output 
constraints binding (8). 

The COVSAS is useful when one wishes to determine the necessary capabilities 
(design requirements) of the actuators In order to achieve the specified output 
performance. That Is, for a given set of o,, the COVSAS finds the minimum 
achievable input performance. 
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III. SENSOR/ ACTUATOR EFFECTIVENESS 


In this Section we temporarily assume that Q and R in (5) are specified 
diagonal matrices Q = diag [... ...], R = diag [... r^ ...], and we wish to 

determine a ranking of the effectiveness of the admissible set of sensors and 
actuators for the LQG problem described by (1) and (5). To help with this task 
a price or "cost" is assigned to each input and output by decomposing the total 
system cost function (5) into contributions from each input and output. This 
task is called "input or output cost analysis" and from [6] we have the results 


( 10 ) 



where l/. u , V? t l/. w , l/. v is the contribution in V of, respectively, the 1^ 
control u.j, output y., noise w.., or noise v.., and 


(11a) 



(lib) V. y « qJIcJj 2 . i = 1, k 

1 1 1 P-OC 

(11c) V i W = w i i 1 1 b i 1 1 k+L 1 = 1’ •••’ m 

(lid) v* = v 11 ||f 1 ||^ 1 - 1 I 

A 

where P, K, X and L satisfy 

(12a) 0 = PA T + AP - PKVV’P + BWB T , [fj, .... f £ ] = F = PtfV 1 

(12b) 0 = KA + A T K - KBR -1 B T K + C T QC. l9 v .... gj a G T = -KBR" 

(12c) 0 = X(A + BG) T + (A + BG)X + FVF T 

(12d) U = L(A - FM) + (A - FM) T L + G T RG 

The effectiveness of the 1 th sensor is measured by 
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(13) 


|/. sens = i/. v 

and the effectiveness of the actuator is measured by 
(14) * V i u - V* 

cpnc 

These terms V. and l/. represent the particular combinations of the input/ 

output costs l/. u , i/- w , l/^ v which are involved in the performance of each sensor 
and actuator. (The distinction here is that the effect of the input can be 
calculated by (/. w , but the effect of an actuator involves both l/^ u and V* since 
the actuator is noisy, and this dependence is accounted for in (14)). To see 

Cane ap+ 

that V. and V. gives the appropriate measure of the effect of deleting 

tn A.U XL X.L. 

the i in sensor or the i w sensor or the l sensor or the i n actuator, refer 
to the numerical work in [7]. 

Two results from [6] add insight into the use of (13), (14). 

Theorem 1, [6,7]: 

For a specified (Q,R)» the optimal value of the LQ6 performance metric (5) 
marmot be reduced by the deletion of any of the admissible sensors z. , 

1 = 1 , .... l. 

Theorem 2, [6,7]: 

For a specified (0,R) the optimal value of the LQG performance metric (5) 
can possibly be reduced by the deletion of some of the admissible actuators 
u.- , i = 1, ...» m. 

cane 

These theorems partially explain why the sensor effectiveness Is a 

much simpler calculation than l/. act . Since the magnitude of the gain on the i** 1 
sensor signal | |f^| | - I l m il Ipp V ii -*• 0 as -*• », an extremely noisy sensor 

simply will not affect the optimal LQG controller. Hence, the effectiveness of 

fU y 

the 1 sensor can be calculated by the Input cost 1/j . Section V will show how 
to use (13) and (14) in the solution of the COVSAS problem. 
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IV. THE COVLQG ALGORITHM 


Now we cite an algorithm (COVLQG) to solve the COVSAS problem under the 
temporary assumption that l = i and m * m. That is, all admissible sensors and 
actuators are used (3 = B and P = M). The COVLQG algorithm will first be stated 
and then its theoretical properties will be discussed. 

The COVLQG algorithm (i.e. the COVSAS with i = m = m) : 

Step A: Compute P from (12a). If o. ^ | | lip > 1 STOP. No solution to 

the COVLQG problem exists. Otherwise initialize 

q^o) = a ." 2 , r^o) = U^ 2 . 


2 

Discussion of Step A: The lower bound on E in an LQG problem is 
E :jf> I l c i! I 2 (from the well known lower bound tr CPC^ on 1/ in (5)), and this 
result is independent of the choice of Q > 0, R > 0. 


Step B: 


Compute 



E 


» U i 


2 



Vi: q.. > 0 


using (11), (12). If a. E V . = 1 V i : q . > 0 and if 

-2 2 11 ' 
y.. ^» u i — ^ V i = 1, ... m, STOP. The COVLQG solution has been 

Discussion of Step B: In the COVLQG option all necessary control effort Is 

2 2 

applied to force the constraints Ejf^ < to be binding. A formal proof 

the stopping criterion of Step B indicates a solution of the COVLQG problem 
given by Theorem 5 of [7]. 


found. 

that 

is 


Step C: Q and R update equations: Let the iteration index be j and set 

q 1 (J+l) = Oj" 2 E oe v. 2 3q i (j) , 1 = 1 k. If (coj 2 )" 1 < q 1 (j+1) < eoT 2 , 

(e < 0 small specified constant) then set q^(j+l) n 0. If 
o i~ 2 E^y.. 2 ■ 1 V 1: q^ > 0, then set r^(j+l) = [y^Eju 2 ]^ 2 ^^), V i: 
y ^” 2 EjiS < 1. For all other i, set r^(j+l) s r^(j). Return to Step B. 

Discussion of Step C: The r.,(j+l) of Step C are clearly adjusted toward the 

' -2 2 

stopping condition of Step B (y^ Ejj^ >_ 1), since a reduction in r^ causes 

Eju* 2 to Increase. The justification for setting q^ = 0 when either q^(j+l) -*■ 0 
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or when q^j+l) -*•<*> is as follows: The tendency of q.. toward zero indicates a 

lack of output controllability due to a degenerate rank of C (rank C < k). In 
this case, the algorithm ceases to attempt the impossible (i.e. to force two 
dependent outputs to arbitrary values) by removing this particular y^ (the least 

critical one as indicated by the smallest q^ -*■ 0) from the cost function by 

setting its coefficient q. ~ 0. Now let rank C = k. The tendency of q. toward 

°° can result only when a stabil izable, detectable system is not output controll- 
able, (even though C = k) and an uncontrollable output converges to a value which 

2 2 

violates its constraint (E jf , , > ). The constraint is violated the smallest 

amount possible since in this case the corresponding q. -*■ <» on successive 

iterations of the update equations. When this condition is determined, such 
y_.'s are removed from the cost function on future iterations (by setting q^ = 0) 

since it now has been established that they cannot be brought within specifica- 

tion.Ej^ 2 < a. 2 . 

A similar algorithm exists for the Constrained Input Variance LQ6 problem 
(CIVLQG) and details are given in [7]. 


V. THE COVSAS ALGORITHM 

The sensor/actuator effectiveness formulas (13), (14) derived in Section III 
and the COVLQG algorithm of Section IV are now integrated to solve the COVSAS 
problem posed in Section'll. 

COVSAS Algorithm : 

- - 2 2 

Step 1. Specify {A,B,C,W,V,*,,m,a ,v >. Run COVLQG algorithm using l 
actuators t m sensors. 

Step 2. Compute y s ® ns f from (13), (14) and rank sensors and 

actuators according to their effectiveness: 


(15a) 


sens 




sens 




sens 


(15b) Vj act > t/ 2 act > ... > V m act 

Delete the sensor and actuator with the lowest effectiveness values 

c AWf 

(/. , I/. , provided such deletion does not cause loss of 
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controllabxlity or observability . * Unless l < £ + 1, reset i xo SL- 

- -? 2 
Inless m < m + 1, reset m to m-1. If o. £ y. =1 V 1 = 1, ...» 

£ 1 1 

and Vi: y. 2 E U . 2 > 1, •£/ [■— 7 y. 2 E u . 2 ]/ • , , \ .. < 

i » i ’ J l £ .£, m i « i J (j+l)iteration 

1 i _ 2 2 

[ T y i E » u i ] (j+l)iteration retw ’ n Ste ? L Otherwise STOP, 
solution to the COVSAS has been found. 


1 . 

k 


A 


Piscussion of Step 2: Nunerica. experience wizh this algorithm suggests that 

more than one sensor and more than one actuator may be deleted on each iteration. 
In fact, tor many cases the sane result can be obtained by reducing t to I and 
m to m on the first iteration. However, this quicker convergence can sometimes 
converge only to suboptimal answers, and the algorithm above is written in its 
most conservative form (deleting only one sensor and/or actuator per iteration) 
where convergence to optimal values is more reliable [7]. 


VI. CONTROL OF A SPACE ANTENNA 

Fig. 1 depicts the Hoop-Column Antenna arrangement for a proposed NASA 
communications satellite. Stationed in a geosynchronous orbit, the objective of 
the antenna control system is to regulate the orientation and focus of the 
satellite antenna relative to its multiple feed horns (at node 10). Table 1 
lists the 24 linear and angular displacements which make up the outputs y. , 

i = 1, .... k, where k = 24. Table 2 lists the 39 admissible sensors and Table 
3 lists the 12 admissible actuators. Note that ARX2 stands for angular rate 
about the x axis at node 2. AX2 stands for angular displacement about axis x at 
node 2. Z10-Z2 stands for a rectilinear displacement between nodes 10 and 2 in 

the z direction. The specifications for the outputs are 0 ^ = 22.8 are seconds 

for i = 1, .... 6, and 0 ^ = .158 mm for i = 7 24. The specifications for 

the inputs u. are y . = 10 dn-cm, i = 1, .... 12. The actuator noise is described 

ii 2 

by W = diag [... ...], = .1 (dy-cm) , V i = 1, ... 12. The sensor noise 

is V = diag [... ...], = 7.615xl0" 7 rad 2 , i = 1,2,3,13,14,15, - 

2.bxl0' 7 m 2 , i = 4, ...» 12, 16, .... 27, V^ * 4.76xl0~ 5 (rad/sec) 2 , i ■ 28, 

..., 39. It is des: ted to limit the number of actuators to 6 » m and the number 
of sensors to 12 = I. The dynamics of the antenna structure were described by 
10 elastic modes and 3 rigid body modes. The square of the frequencies 


^Observability, controllability checks are particularly simple for flexible 
space structures using the tests in [8]. That is, rank tests of matrices 

[B; AB, ... A n_1 B], [C T , A T C T , ... A Tn_1 C T ] can be avoided. 
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2 

id.. , i = 1, .... 10 of the elastic nodes are 

(oij 2 , uj 2 2 , .... a> 10 2 ) = (.40579, 7.2090, 7.2362, 13.2?/, 
44.834, 132.14, 14?. 66, 445.01, 448.69, 775.86) (rad/sec) 2 . 


More complete information for the antenna model nay be found in [7]. 

The results of the COVSAS algorithm applied to the Hoop-Column Antenna are 
summarized in Table 4. The 6 actuators deleted from the admissible set of 
Table 3 are (listed in order of deletion): u g> u g» u iq* u 7* u 4 # ^ 

sensors deleted (in order of deletion) are: z^g, z 3 , Zg, Zj 2 , z, Zj 3 , z 2 , Zj, 

z 24 , z 27 , z 4 , z 5 , z 18 , z 21 , z 3Q , z 3g , z 33 , z 7 , Zg, z 31 , z 23 , k. 20 , z 35 , z 25 , z 22 , 

z^g. Notice that even though the output constraints are still binding the total 

control effort is less using only 6 actuators, (6x5.021 ■ 30.12) than using 12 
actuators (12x3.275 = 39.30 > 30.12). Thus, better performance is possible with 

fewer actuators, since for several actuators the noise effect P. Vi is greater 

than the signal effect V. u in (14) (note thw- negative values of i^. act in Table 

4). 


Perhaps the most important information from t^® C^’^AS is the determine* 
of the minimum achievable actuator specification ' r from Table 5 that all 
of the 24 outputs are held within their design constiainu (a* = 22.8 are secs. 

for angles and a.. = .158 mm for rectilinear displacements) by actuators which 

must be design*, i for the capabilities of TABLE 5. That is, the given output 

specifications, of are possible to meet if is changed (=> actuators are 

redesigned) (from Table 5) to Uj = 73, y 2 = 26, y 3 * 105, y 4 = 26, y g * 32, 

u 6 = 39 * 


VII. CONCLUSIONS 

Presented is an algorithm COVSAS which Integrates the following tasks: 

Selects sensors and actuators from an admissible set. 

Designs a linear feedback controller which satisfies output variance 
constraints. 

Determines actuator design requirements which allow the output variance 
constraints to be satisfied. 
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Numerical properties of the convergence of this algorithm are given for NASA's 
Hoop-Column Antenna. Additional theoretical properties of convergence of this 
algorithm are given in [7], 
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Figure 1: Hoop Column Antenna 
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Table 1: Hoop Column Output Description 

Output # Type Nodal Location Direction 


1 

Inertial Angle 


2 


X 

2 


II 


2 


Y 

3 


II 


2 


Z 

4 

Rel-ative Angle Between 10 

and 2 


X 

5 


II 


M 


Y 

6 


Inertial Angle 

10 


Z 

7 

Relative Linear Disp. 

Between 6 

and 2 


X 

8 


11 


It 


Y 

9 


II 

9 

and 2 


X 

10 


H 


N 


Y 

11 


II 

10 

and 2 


X 

12 


n 


N 


Y 

13 


it 

101 

and 10 


X 

14 


it 


H 


Y 

15 


ii 


II 


Z 

16 


n 

107 

and 10 


X 

17 


ii 


II 


Y 

18 


ii 


II 


Z 

19 


ii 

113 

and 10 


X 

20 


it 


li 


Y 

21 


ii 


H 


Z 

22 


n 

119 

and 10 


X 

23 


it 


II 


V 

i 

24 


it 


II 


z 


Table 2: Hoop- 

Column Sensor Labels 



Sensor 


Sensor 


Sensor 



Number 

Label 

Number 

Label 

Number 

Label 


1 

AX2 

14 

AY 10 

27 

Z119-Z10 

2 

AY2 

15 

AZ10 

28 

ARX2 


3 

AZ2 

16 

X101-X10 

29 

ARY2 


4 

;;6-x2 

17 

Y101-Y10 

30 

ARZ2 


5 

Y6-Y2 

18 

Z101-Z10 

31 

ARX6 


6 

Z6-Z2 

19 

X107-X10 

32 

ARY6 


7 

X9-X2 

20 

Y107-Y10 

33 

ARZ6 


8 

Y9-Y2 

21 

Z107-Z10 

34 

ARX9 


9 

Z9-Z2 

22 

X113-X10 

35 

ARY9 


10 

X10-X2 

23 

Y113-Y10 

36 

ARZ9 


11 

Y10-Y2 

24 

Z113-Z1Q 

37 

ARX10 


12 

Z10-Z2 

25 

X119-X10 

38 

ARY10 


13 

AX10 

26 

Y119-Y10 

39 

ARZ10 
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Table 3: Hoop Column Actuator Description 

Actuator torque abott 

axis at 
Node location 

Uj = T X 2 
u ? = T Y 2 

u 3 s T Z 2 
u 4 = T X 6 
u 5 = TYb 
u 6 = JZ 6 
u ? = T X 9 
u 8 = TY9 
Ug - T Z 9 
u 10= TX10 
u n = T Y 10 
u 12 = T Z 10 
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Table 4: Hoop Column Output Constrained COVSAS Results 


Iteration 

Number 

Identified 
Sensors (l£ en ) 

Identified 

Actuators 

d^ ct ) 

Ave Input 
Value 
(7.6) 

Number of 
Sensors/Actuators 

1 

AZ10(. 00041 16) 
AZ2(. 000397) 
Z6-Z2(0) 
Z9-Z2(0) 
Z10-Z2(0) 

TZ10(-1.362) 
TZ9(— 1.369) 

3.275 

39/12 

2 

AY1(.0C3362) 
AX10(. 003358) 
AY2(. 00226) 

AX2(. 00226) 
Z113-Z10(. 001942) 
Z119-Z10(. 001884) 

TZ6(-2. 1405) 

3.592 

34/10 

3 

X6-X2(. 01457) 
Y6-Y2(. 01455) 
ZlOl-ZlO(.OllO) 
Z1C7-Z10(.0108) 

TX10( -1.2055) 

3.699 

28/9 

4 

ARZ2(. 02844 
ARZ10(. 02232) 
ARZ6(. 02238) 

TX9( -1.2917) 

3.997 

24/8 

5 

X9-X2(.0986) 

Y9-Y2(.0839) 

TX6(-1.4793) 

4.377 

21/7 

6 

ARX6(. 07648) 
ARX2(. 07648) 

— 

C.829 

19/6 

7 

Y107-Y10(. 13395) 
XRY9(.1098) 

— 

4.857 

17/6 

8 

XI 19-X10( . 1557 ) 
XI 13-X10( . 1555) 
X101-X10( .1551) 


4.905 

15/6 

9 

.... 



5.021 

12/6 
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Table 5: Output-constrained Specifications 


E u. 


Output # 

E v. 2 

** l 

Actuator # 

W | 

(minimum achievabl 

1(AX2) 

.015 sec 

1 TX2 

72.91 dn-cm 

2(AY2) 

.015 sec 

2 TY2 

26.145 dn-cm 

3 ( AZ2 ) 

11.588 sec 

3 TZ2 

105.47 dn-cm 

4(AX10-AX2) 

.001 sec 

4 TY6 

26.138 dn-cm 

5(AY10-AY2) 

.001 sec 

5 TY9 

31.750 dn-cm 

6 ( AZ 10) 

12.000 sec 

6 TY10 

38.812 dn-cm 

7(X6-X2) 

.010 ran 



8( Y6-Y2) 

.010 ran 



9(X9-X2) 

.068 ran 



10(Y9-Y2) 

.068 mn 



11(X10-X2) 

.158 ran 



12( Y10-Y2 ) 

.158 ran 



13 ( X101-X10) 

.104 ran 



14( Y101-Y10) 

.158 ran 



15fZ101-Z10) 

.007 ran 



16(X107-X10) 

.158 mn 



17 ( Y107-Y10 ) 

.156 ran 



18( Z107-Z10) 

.008 ran 



19 ( X113-X10) 

. 122 ran 



20( Y113-Y10) 

.158 ran 



21 ( Z1 13-Z10) 

.001 ran 



22(X119-X10) 

.158 ran 



23 ( Y1 19-Y10) 

.091 ran 



24(Z119-Z10) 

.001 ran 
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Buffalo, NY 14260 


ABSTRACT 

A pole placement algorithm is proposed which uses constrained non-linear 
optimization techniques on a finite dimensional model of a linear n degree of 
freedom system. Low order feedback control is assumed where r poles may be 
assigned; r being the rank of the sensor coefficient matrix. It is shown that by 
combining feedback control theory methods with optimization techniques, one can 
ensure the stability characteristics of a system, and can alter its transient 
response. 


INTRODUCTION 

One common method of approaching the problems of controlling the vibration 
cf a structure is o employ eigenvalue (pole) placement methods. Such solutions 
have attracted the attention of numerous authors over the past twenty-five years, 
including W. M. Wonham [6], E. J. Davison [3], S. Srinathkumar [5l» A. 5. Andry 
et al [l], [2] and many others. 

In exploring pole placement in dynamical systems, an inadequacy of stability 
considerations in contemporary algorithms was noted and thus motivated this work. 
It appears that the problem has not been solved or even addressed in many 
approaches . 

If a system is controllable, one has the ability to place a predetermined 
number of poles. Thus, when pole placement techniques are employed, there is a 
limit on the number of poles that may be assigned. As is well known, the rank of 
the sensor coefficient matrix determines how many poles may be placed exactly. 
These poles may be noted as the controllable eigenvalues of the system, while the 
remaining may be labelled uncontrollable . 
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••Research supported by AFOSR Grant Number AFOSR 82-021+2 
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Thus, due to restrictions inherent to every system, every pole may not be 
desirably placed. Therefore, one does not have control over the full order of 
the system. When moving the allowable eigenvalues, those which are not placed 
will also be affected, with the possibility of generating an unstable state. 

Since an unstable system is undesirable, the ability to place a pre- 
determined number of poles, while forcing the system to remain stable would be 
quite desirable to the designer. Many pole placement methods yield satisfac- 
tory assignment of the desired modes, but unfortunately can drive the remaining 
eigenvalues unstable. Thus, requiring iteration of the algorithms, compromising 
the desired choice of eigenvalues or eigenvectors, until a stable response 
results. With the large number of modes required in modelling flexible struc- 
tures, these methods become costly and time consuming. 

Hence, a pole placement method is proposed which constrains the unspecified 
modes to be stable by taking advantage of constrained optimization techniques. 

It appears that no previous work has guaranteed stable unplaced poles or has 
assured the magnitude of relative stability. 

Several numerical examples will be presented, and results will be compared 
with those of Srinathkumar [ 5 ]. 


PROPOSED SOLUTION 

The systems studied in this paper are of the mechanical type, which are 
second order by nature, incorporating mass, stiffness and damping parameters, 
where only the class of discrete systems shall be investigated. 

Assuming small motions about the equilibrium point implies linearization 
of the equations of motion, which become 

[M]q(t) + [D+G]i(t) + [S+H]a(t) - F(t) (1) 

The forcing function vector, F(t), may then be described as 

F(t) = [V]q(t) + [ P ] q( t ) , 

where [V] and [P] are the velocity and position feedback matrices, respectively. 
q(t) is the coordinate ■vector, while £(t) and q(t) are the first ana second time 
derivatives of this vector. 

[M] is known as the mass or inertia matrix, [D] is called the damping matrix, 
and [s] is the stiffness matrix. The matrix [G] may be referred to as the gyro- 
scopic or Coriolos matrix, and [H] is the circulatory matrix. 

The [M] , [D], [S], [g] and [H] matrices are assumed to be time-invariant, 
and therefore are represented by constant values, all be^'ng of nth order, where 
n represents the number of degrees of freedom of the sysxem. 

Using normal state space methods by letting 
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More simply, equation (2) may be expressed as follows: 
x(t) = [A']x(t) + [B]u(t), x(0) ~ x^. 

Z(t) = [C]x(t) 
where u(t) = [K]^(t) 

y.(t) is the output vector, [CJ is a constant sensor coefficient matrix, and [K] 
is the feedback gain matrix. [B 1 may now be described as the constant coeffi- 
cient matrix of actuator dynamics, and u(t) is the control vector. The following 
conditions hold: 

i) xeR^ n , ueR m , £eR r 

ii) A', B, C are real, constat matrices of appropriate dimensions, 

iii) rank B = m ? 0, rank C = r # 0 


By block diagram representation, the system described by equation (3) may be 
expressed as in Figure 1. 



FIGURE I 
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And a more revealing representation is shown in Figure 2. 



FIGURE 2 


Equation (2) may be rewritten as follows: 


x(t) = 

-M _1 ( D+G ) 1 -M 1 (S+H) 

x(t) + 

h 


I ! 0 

n i 


3_ 


[kHc^ | c 2 ]x(t) 


or 


c(t) = 


-M' 1 (D+G) 

-M -1 ( S+H ) 

x(t) + 

B 1 KC 1 | 

I 

0 

B 0 KC, 1 B^KC_ 

L n i J 


2 li 2 2 _ 


x(t) 


Z(t) - [C 1( l Cg]x(t) 

By comparison of equations (l) and (3), one may note that this implies: 

lB 2 ] = [0], 


(3) 


thus 


x(t) = 



-M -1 ( D+G ) ! -M _1 ( S+H ) 

x(t ) + 

B i KC i 

H C _2 


! 


0 

0 


x(t) 


Z(t) = [C^ jc 2 ]x(t) 


( 4 ) 


If we define 


[A] = 


-M -1 (D+G) + B 1 KC 1 


-M _1 (S+H) + B 1 KC 
0 
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and describe equation (4) as follows: 

x = [A]x 
Z = [C]x 

Then, the set of equations must satisfy the eigenvalue problem, i.e., 
[‘IVj - 


( 5 ) 


where 


w: 


the 2n eigenvalues 


2n 


and 


jv^j = the corresponding eigenvectors . 
. By substitution of equation (4) into equation (5)» 


C i^i = 


-M _ 1 ( D+G ) { -M _ 1 (S+H) 

; 

T ! o 



B.KC, 

i B, KC- 

-tr 4 . 

+ 

11 

1 2 


0 

|T 

1 


n i 

v. may then be defined to correspond to the above partitioning as follows: 


yielding 

% 

which implies 
substituting, 


— —1 

Z. 

1 


w. 

1 _ 



i 


v, = 


h 

*ri 


-M' X (D+G) j-M -1 (S+H) 
*n ! 0 




h 




Y C 1 ! B 1 KC 2 

' i““ 






= -M" 1 (D+G)C i w 1 - M _1 (S+H)w 1 + + B^KC^ 

If we define { } i = 1,2,. .,,r as the r eigenvalues to be placed, 
equation (6) may be expressed as 

WA 2 = -M _1 (EH-G)WA - M“ 1 (S+H)W + B^C^WA + B^C^W 


( 6 ) 
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where 


j V ] 

i -r 


and A = diag(X ,X„ X ) 

x d r 

By taking advan J age of the generalized left inverse theorem, 

[K] = [ B^B 1 ] _1 [ ] [ WA^+M -1 ( D+G ) WA + M _1 (S+H)w] [C WA+C W] -1 , 

which is the equation describing the gain matrix needed to obtain those eigen- 
values desired. 

A single objective function was then determined from the set of equations 
described by equation (7), where the values of [K] were determined by minimising 
that objective function. The constraints imposed on the system were that the 
real part of the eigenvalues of the closed loop system were all negative. These 
constraints were also modified, as was desired, to inc: ease the stability margin. 

NUMERICAL EXAMPLES 


Example 1: 



FIGURE 3 

Specifications: = 1 
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Eigenvalues of unforced system: 


-1.666207 ± 1.4l334i 
- .333783 t -.832651 


[C] 


O ' 0 0 

_! J 0 0_ 


[B] 


1 0 
0 1 
0 0 
0 0 


Desired eigenvalues: 


X 


1 


X 


2 


-4.0 + Oi 
-3.0 + Oi 


Resulting eigenvalues using the proposed method with no additional factor for 
relative stability: 

-4.000000 + Oi 
-1.335420 + Oi 
- .249608 + Oi 
--. 000000 + Oi 


Resulting eigenvalues using the proposed method with added factor of relative 
stability: 

-4.000000 + Oi 
-2.999999 + Oi 
- .572543 ± 0.743531 


Resulting eigenvalues using Srinathkumar method: 

9.1256 + Oi 
- .8141 + Oi 
-4.0000 + Oi 
-3.0000 + Oi 

Note that the method proposed here yields the desired .eigenvalues and that 
the unspecified eigenvalues remain stable, whereas .ca the- Srinathkumar method 
an unspecified eigenvalue is moved into the right half plane. 
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Example 2: 



Eigenvalues of unforced system: 


-.004055 ± 1.64T953i 
-.170649 ± 1.13l4l8i 
-.062364 ± .3556?4i 

-.075432 ± .73044li 

Tl 0 0 0 0 0 0 ol 

1 J " |_0 0 0 1 0 0 0 OJ 


10 0 
0 10 
0 0 1 
[B] = 000 

0 0 0 
0 0 0 

_° 0 °_. 

Desired eigenvalues: X _ - -.4 ± .5 

1 **- 
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Resulting eigenvalues using the proposed method, where a factor for relative 
stability was added: 


-.28931*2 ± 1.378583i 
-.H*5l*5i ± 1.1713U5i 
-.1*00007 ± .500003i 

-.19781*0 ± .1*25941*1 

Example 3: 



FIGURE 5 


Specifications: = m^ = 1 

S 1 = 3 
S 2 = ! 

Eigenvalues of unforced system: 

±2.07l*313i 
± .8350001 

[C] =[1100] 

~1 0 “ 

w - 2 0 
0 0 


Desired eigenvalue: .5 + 0i 


Resulting < igenvalues using the proposed method, where factor for relative 
stability was added: 


- .170373 ± 1. 8090971 
-1.817157 + 01 

- .500000 + 01 


CONCLUSION 

A pole placement algorithm has been proposed which used constrained non- 
linear programming techniques for a finite dimensional model of a linear n degree 
of freedom system. It has been shewn that by constraining the eigenvalues of the 
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full order system while simultaneously placing those allowable, one can ensure 
the stability characteristics of a system, and can alter its transient response. 

Results of the Srinathkumar method were presented for Example 1, and showed 
how this method yielded the desired eigenvalues quite accurately, yet unfortu- 
nately forced the originally stable system unstable, therefore resulting in an 
undesirable response. 

No previous work has guaranteed stable unplaced poles or has assured the 
magnitude of relative stability. 
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ABSTRACT 

The application of matrix transfer function design techniques to the 
problem of disturbance rejection on a flexible space structure is demonstrated. 
The design approach is based on parameterizing a class of stabilising 
compensators for the plant and formulating the design specifications as a 
constrained minimisation problem in terms of these parameters. The solution 
yields a matrix transfer function representation of the compensator. A state 
space realization of the compensator is constructed to investigate performance 
and stability on the nominal and perturbed models. The application is made to 
the ACOSS (Active Control of Space Structures) optical structure. 


I. INTRODUCTION 


The problem of flexible space structure control has motivated a great 
deal of research for theoreticians and practitioners of multivariable control 
design. In spite of the efforts directed in this area there still remains a 
significant gap between the multivariable theory and the control design 
implementation. This gap stems from two sources. The first difficulty is one 
of problem specification. Translation of complex system* requirements and 
constraints into the specific mathesmtical cost functionals required by most 
design methods may be impossible in many cases. Free parameters in the chosen 
design methodology may not be traceable to the parameters which describe the 
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system in terms of desired performance, plant uncertainty, hardware 
limitations, etc* A second roadblock to the implementation of modern control 
design techniques is the iack of reliable algorithms and software to perform 
the sophisticated mathematical manipulations required by these techniques. 
Recent years have shown very considerable advances in this field (see [1]) but 
much remains to be done. 

Most of the MIMO (multi- input/multi-output) compensators which have 
actually left the textbook and been calculated in computers are based on state 
space methods, and, in particular, LQG ( Linear-Quadrat ic-Guas a ian) design 
theory. This is due in part to the long history of development of these 
design techniques as well as the availability of reliable algorithms to solve 
matrix Riccati equations and the ease of performing most state space 
manipulations. Frequency domain techniques for calculating MIMO feedback 
systems have been avoided. The extensions of classical frequency domain 
concepts to MIMO systems have not been totally satisfying and calculations 
involving matrices of transfer functions present an entirely new set of 
problems. Nonetheless, frequency domain design is still appealing and certain 
feedback notions cannot be adequately expressed without reference to transfer 
functions. 

We have carried through a compensator design for a flexible structure 
based on transfer function parameterization techniques. General theories of 
feedback control system parameterization have been developed by several 
authors ([2], [3], and [4]). The goal of a parametric approach is tne 
selection of a set of numerical quantities, along with an acceptable range of 
values, which span a class of possibly acceptable compensators and, wi - 
which, one is able to adequately express the system requirements in terms of 
cos 1 8 and constraints. A particularly simple parameterization for stable 
plants was introduced by Zanies, [4], and exploited for the unity feedback 
conf iguration of Figure l bv Desoer and Chen [5]. This is the 
parameterization we will implement here. The details are in section IV. 
Previous examples of this design approach can be found in [6] and [7]. 
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II. ACOSS STRUCTURE 


The ACOSS optical structure was developed by the Charles Stark Draper 
Laboratories as a control design test specimen to evaluate the design 
approaches developed for the DARPA ACOSS program, f 8] • It was designed to 
exhibit the closely spaced, low frequency mode distribution expected on some 
future space systems. The structure is provided as a finite element model 
having 84 dynamic degrees of freedom (see Figure 2). In addition to the 
nominal structure, two perturbed structures were defined to represent plant 
uncertainty. The perturbed models represent mass and stiffness variations of 
approximately 10Z. The nominal model is denoted PO, the perturbed models are 
P2 and P4. 

The performance goal is expressed in terms of a line of sight error on a 
focal plane on the lower section of the truss as shown in Figure 2* The error 
has two angular components and a defocus component resulting from deviations 
in the optical path due to structural vibrations. Three rigid mirrors 
determine the optical path. These are assumed to be rigidly mounted to the 
structure. Two disturbances are defined ou he structure as shown in the 
figure. For our design problem we are only considering the disturbance 
propagating from the equipment panel and we assume it has a flat PSD out to 
5 Hz. The equipment panel is isolated from the structure by a spring-damper 
system. The residual disturbance propagation through this isolation system 
into the line of sight is still unacceptably high. The control problem is to 
further reduce this residual with active structural control. 


III. MODEL SELECTION AND ACTUATOR PLACEMENT 


For the current design problem we chose a 5 mode model of the structure, 
selecting those modes having most significant influence between disturbance 
and line of sight. The modal influence was determined based on ideas from 
internally balanced coordinates. For a description of internally balanced 
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coordinates see f 9 ] and for an application to modal coordinates see [10l. 
Given a second order model description. 


q. + 2 C.OJ.q. + U). 2 q. 3 gT w i i * i,...,n 

i ill l l a i — * 


(l) 


y = y h .q . 

in 


( 2 ) 


^ith natural damping £ , frequency U) , inputs w, and outputs an index 

i i 

ranking the modes can be calculated as the approximate "second 
order modes," ([10]) by 


cr. 

i 


* —l—i 


) 


4 C-W. 
i i 


(3) 


Using the modal disturbance influence matrix for the g^'a and the line 
◦f sight measurement matrix for the h^'s the 5 highest rank modes are 
tabulated in Table 1. Agreeing with our intuition, these turn out to be two 
isolator rotations, two isolator translations, and the first bending mode of 
the upper truss* A description of the modes of the structure can be found in 
[ 8 ]. 



7 

8 

12 

13 

B 

Frequency (Hz) 

a 

.26 

.58 

.58 

2.3 


Table 1* Design Modes 


The line of sight measurement matrix is a function of 21 nodal degrees 
of freedom, from among these 21 degrees of freedom we chose to locate three 
force actuators (assumed to be of the momentum exchange or proof mass type) to 
control the three line of sight measurements* To make this selection an 
appeal is again made to the approximate second order modes of equation (3). 
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If the forcing function on the right of (l) is g .u where g.. is the 


.th 




*th 




influence of the j actuator, j * 1,...,21 on the i mode, i * l,...,5, 
then we denote the corresponding second order mode by and define 


2 

a. = 

J 


5 

I 

i 3 l 


a. . 

iJ 


( 4 ) 


Here, a. is a measure of the influence of the j** 1 actuator on the 
line of sight for the selected 5 mode model. We chose three actuators whose 
force directions span the three spatial directions and have large a. with 
respect to the total 21 possible actuators.* Two of the actuators selected are 
located on the corners of the primary mirror and the third is on the lower 
truss • 


To complete the description of the design plant we assumed the 
availability of direct measurements of the line of sight. No other sensors 
were used for the control design. We now have a state space description of 
the design plant in modal coordinates, 

x - Fx + Gu + Dd (5) 

y * Hx (6) 


where u is .he actuator command and d is the disturbance input. 

Per calculation of the compensator we need a transfer function 
represents ion of the design plant. The convenient representation for 
constructs g state space realisations of the compensator is a polynomial 
matrix coprime factorisation [11, 12 1 , that is, P * ND ^ where N and D are 
coprime polynomial matrices. An algorithm to construct a coprime 
factorisation from a state space description can be found in [13]. 
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IV. DESIGN PROBLEM 


The feedback configuration used for the design is shown in Figure 1. 

The closed loop system is referred to as P is the open loop design 

plant , a 3x3 transfer function given from the state space equations by 

H(sl-F) *G. The inputs are u^ and u^ with the reference input, u^, 

identically zero. The outputs are and y ^ with the line of sight 

represented by y . The disturbance propagates into the line of sight 
1 ~~ , -I 

through the transfer function P =* H(sl-F) D and may thus be represented as 

*v 

an additive disturbance, Pd, at the plant output. 

The closed loop system transfer function is defined to be 


H 


yu 



(7) 


Stability of H can be taken to be closed loop stability. 
yu 

expressed in a simple parameterized form as 


H may be 
yu 


Q ~QP 

H 

yu 

PQ P(I-QP) 


( 8 ) 


where Q is referred to as the Zames parameterization, [4], with 

Q * C(I + PC)" 1 . 


(9) 


We state here the fundamental result from [5] which is the basis of this design 
approach. 

Fact : For P exponentially stable and strictly proper, Q is exponentially 
stable and proper if and only if 
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(i) C is proper and 

(ii) Hy u is exponentially stable and proper. 

When this is the case the compensator is given by 

C - Q(I - PQ)" 1 (10) 

In other words designing stabilizing compensators for is equivalent to 
specifying exponentially stable, proper Q. 

From (8) we see that the I/O map, that- is, the transfer function from 
U 1 t0 y 2 * s 


H * H = PQ (ll) 

y 2 U l 

Given an invertible plant transfer function, P, one can see from the 
reLation (11) that a parametrization of the closed loop system by Q is 
equivalent to a parameterization by H. Moreover, for P exponentially stable, 

Q exponentially stable implies the same for H. But since 

Q - P~ l H (12) 

it becomes clear that exponential stability of H only implies exponential 
stability of Q when P has no unstable zeros. However, by imposing an 
additional condition on H, namely that H has the same right half plane zero 
structure as P, then parameterization by such H is equivalent to parameter iza- 
tion by exponentially stable Q. If a proper compensator is desired the 
additional constraint of properness of Q is required and will result in an 
excess pole over zero constraint on H which depends on P. 

Parameterization by the I/O map, H, may simplify the design problem and 
allow the designer to more directly specify his design objective. For 
example, for a disturbance attenuation problem, the closed loop disturbance to 
output map, or sensitivity map, is simply given as (I - H). In addition, in 
some applications, a decoupled I/O map is desirable and one is directly able 
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to parameterize a diagonal H. This is the approach we take for this design. 
Calculating the transmission zeros of cur design plant using the QZ algorithm 
[14] we find that there are no zeros in the right half plane so we may freely 
specify H as diag(h^, i=l,2,3) with each h^ of the form 



8 P„<»> 


UY 


(s) 


(13) 


where g is a gain and 


2 2 

p (s) 3 s + 2J tua + w (14) 

n n n * n 

p (s) ■ s 2 + 2£ (0 s + 0) 2 (15) 

a . a . a * a . 

J J J 3 

This parameterization has 21 parameters consisting of the gains , and 
second order damping and frequency terms* 

We set for ourselves a design goal of minimizing closed loop response to 
the disturbance over a low frequency band of 5 Hz* To achieve this we define 
a constrained optimization problem as follows: 

Minimize 


subject to 


J - 11(1 - H(jU))) P(j0))ll 2 , 0) - 10TT 


(16) 


0.01 < C . C 

J 


.‘Stability 


0.04 < 0) * w. < U). bandwidth 
n d, b 
J 


h. (0) * 1 

l 


:Low frequency rcise rejection 
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The matrix under the norm of J is diagonal so we aimj^v take the 
Euclidean vector norm of the diagonal. The minimization o* ne cost at 5 Hz 
and the DC unity gain constraint will result in disturbance rejection across 
the 5 Hz band* The P term in the cost weights the diagonal terms in (I - H) 
according to the way the disturbance propagates through the structure. 

In general (I - H) is the ratio of the relative uncertainty in the I/O 
map to the relative uncertainty in the open loop plant. More nreciseiv 

(AH)H* 1 - (I~H)(AP)P” 1 (17) 


where 


AP » P - P 

(ltJ7 

AH - H - H 

(19) 


for a "perturbed" plant P which results in a perturbed I/O map H. In effect, 
minimization of J reduces the impact of plant uncertainties on closed loop 
system performance. 


Having specified the optimization problem one can use numerical or 
analytical means to solve it. Omitting the details, we calculated a local 
minimum to this problem analytically. The achievable performance is clearly 
dependent on the bandwidth, 0)^, For a 
satisfies 


U). 

- 

(20) 

d. 

J 

b 



- p (s) 
d 2 

(21) 

2C 

0) 


n 

r* 

(2^ 

C d. 

J 

V 

j 



given <4 the local minimum 
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4 



0 ) 

n 


Given this solution we can adjust the bandwidth of each of the three 
loops to achieve a desired performance level. To achieve 0.04 reduction in 
each channel we have the following parameter values: 



V 

J 

C d. 

J 

0) 

n 


g 

h l 

300 

1.5 

10 4 . 

100 

81 

h 2 

600 

.8 

4XX0 4 

107 

81 

h 3 

300 

5 

2X10 3 

200 

25 


V. COMPENSAiOR REALIZATION 


Having arrived at parameter values we have specified tne desired I/O 
map, H. The compensator which will produce this l/° mnp is 


C - Q(I - PQ) 


(24) 


* p“Hi(l - H)" 1 . 


(25) 


Since the I/O map is given by diag(h^ ) with each h^ of the form 


h. 

i 


n. 

l 


d. 

l 


» 


( 26 ) 
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the compensator becomes 


C 


P *diag 



(27) 


We have already expressed P as a polynomial matrix coprime factorization, 
P - ND Thus (27) becomes 


C * 


D N l diag 



(28) 


Since the degree of d^ - n^ is 4, we can factor this polynomial into two 
quadratics as 


d. - n. « d. d. , i - 1,2,3. 

ii i L i 2 


Hence (29) can be rewritten as 


C 


D { diag(d. )N } * 



(29) 


(30) 


By inspection, diag(d. )N is column- reduced [12], and has column degrees 

H - . 

equaling those of D. Consequently D {diag(d. )H} is proper and has a state 

l l 

space realization [12, Sec. 6.4]. Now, since diag(n./d. ) also has a state 

1 V 

space realization, the two realizations can be cascaded to yield a realization 
for C. 
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VI. RESULTS 


Having computed a state space description of tho compensator we are now 
able to determine closed loop stability for various versions of the plant 
simply by extracting the eigenvalues from the closed loop state equations 
derived from Figure 1. We find that for all three versions of the plant (PO, 
P2, and P4) the five mode description remains stable under feedback by our 
compensator. 


To investigate the robustness of the design with respect to unmodeled 
dynamics we appended additional modes to the plant model and found that the 
closed loop system became unstable in almost all cases. Upon investigation of 
this problem we discovered that though the 5 mode design plant was minimum 
phase, the addition of almost any other mode or set of modes resulted in a 
nonminimum phase plant. Information about these unstable zeros was not 
available in the design plant so the resulting compensator tended to place 
closed loop poles at these zeros. Thus the stability problem experienced is 
one of modeling or model reduction. In general, any control design approach 
must have information about the righ*^ h*lf plane zeros of the plant. 


The performance of the closed loop system remained very consistent with 
the predictions made during the design stage. The steady state RSS response 
at 5 hz of the two angular components of the line of sight is given as a 
fraction of open loop response for the three models by: 


4 


PO 

P2 

P4 

-2 

-2 

-2 

4.3 X 10 

5.1 X 10 4 

4.6 X 10 


The broadband disturbance attenuation is illustrated on the Bode plots 
of Figures 3 and 4 which compare open and closed loop response. Across a 
significant portion of th* 5 Hz band the performance improvement is 3 to 4 
orders of magnitude. 
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VII. CONCLUSIONS 


We have demonstrated the applicability of a transfer function parameteri- 
zation design approach for problems of broadband disturbance attenuation on 
flexible space structures. This methodology provides the control designer 
with a great deal of flexibility to meet system requirements by the choice of 
parameter set and selection of cost function and constraints. Although the 
implementation of this technique requires difficult numerical calculations 
involving matrix transfer functions, algorithms and software for these types 
of problems are already emerging. The success of this approach is dependent 
on an appropriate parameter selection in which to express the problem 
specifications. This suggests research, probably application specific, which 
addresses the issues of problem description and requirements interpretation in 
the control design process. 
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INTRODUCTION 

The control design problem for the class of future spacecraft referred to as 
large space structures (LSS) is by now well known [1-31. The issue is the re- 
duced order control of a very high-order, lightly damped system with uncertain 
system parameters, particularly in the high frequency modes. This paper pre- 
sents a design methodology which incorporates robustness considerations as 
part of the design process. Combining pertinent results from multivariable 
systems theory and optimal control and estimation, LOG eigenstructure assign- 
ment [4] and LQG frequency-shaping, [5-7] were used to improve singular value 
robustness measures in the presence of control and observation spillover. 

The design technique is summarized as follows. A low order LQG compensator is 
synthesized using the technique of recursive eigenstructure assignment to 
place closed-loop eigenvalues where desired. This design is evaluated for 
singular value performance margin and for singular value gain margin with 
respect to plant uncertainties (e.g., modeled dynamics). The compensator is 
then resynthesized using frequency-shaping concepts to improve the singular 
value robustness measures. The recursive eigenstructure assignment technique 
allows regulator close-loop eigenvalue placement at the desired locations for 
the plant and as required for frequency-shaping. Furthermore, the frequency- 
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shaped compensator eigenvalues can also be assigned, thus assuring LOG com- 
pensator stability, as well as estimator stability. 

This procedure using robust frequency-shaped compensation was applied to the 
design of the controller for a representative large space structure. Results 
are presented as singular value Bode i.!ots. Comparisons are made to a recent 
study9 utilizing the same large space structure model. 


LQG CONTROL DESIGN FOR LSS 

Control design plant modelling for LSS utilizes a high-order structural model, 
typically obtained by finite- element programs such as NASTRAN. The limita- 
tions of computer implementation require that the finite-element model be 
reduced to a design model. One approach is to truncate the high-order model 
into primary and residual modes, where the primary modes are to be used for 
control design. The modal truncation can be based on engineering judgement or 
on a selection criterion such as modal cost analysis [93. 

The system model has the form 

x p * Ap Xp + BpU 
• 

X R ” Ar xr + Bru (1) 

y - Cp Xp ♦ Cr xr 
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where x p are the primary modes and xr are the residual modes. An ob- 
server-based control design for v he primary modes then has the form 

X P » Ap x p + B p u + G(y - C p 5p) (2) 

A 

u = -KXp 

Using LQG design, the gains (K, G) are selected to minimize quadratic perform- 
ance indices. The terms Br u and Cr xr were identified by Balas [33 as control 
spillover and observation spillover respectively. These terms have the poten- 
tial fur interacting through the observer (2) to produce instability. 

LQG theory guarantees that the reduced-order closed loop system is stable with 

eigenvalues of (A p -B p K) and (A p -GC p ). However, no such guarantee holds for 
the compensator, 

u *= Hy (3) 

which has the eigenvalues of (A p -B p K-GC p ) . This fact can be fatal for LSS 
reduced-order control, unless measures are taken to ensure system robustness. 

ROBUSTNESS MEASURES FOR LSS 

For multivariable feedback systems the emerging singular vali *> robustness 
theory can be used to develop measures for stability and performance. Kosut, 
et al,8 applied this theory to the large space structure control design prob- 
lem, treating the residual dynamics as a perturbation. For a system with a 
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stable nominal feedback system (based on the reduced model) and stable pertur- 
bations (due to the residual dynamics), sufficient conditions for stability 
are obtained when the singular value stability measures exceed the maximum 
perturbation due to model uncertainty. Fig. 1 defines the terminology for a 
large space structure control system. For an additive perturbation, cig. 2a, 
the sufficient conditions for stability are 


SMi = ^ [I + H(jto) G c (ym)] > ^ [H(jo>) Gr (jto)] 
3M 2 = o [I + G c (jo>) H(ju)] > o [Gr (jto) H(jo,)] 


(4) 


where o (•) indicates the maximum singular value and o_ (•) indicates the mini- 
mum singular value. (Singular values of the complex matrix A are the positive 
square roots of the eigenvalues of A*A, where (•)* indie n os conjugate 
tran-spose.) If G c (s) is minimum phase and invertible, a multiplicative per- 
turbation can be formed, Fig. 2b, and the sufficient conditions for stability 
are then 


SM^ = _o [I + (HG C )~1] > o [Gq-^Gr] 


(5) 


S.1 ? = £ [I + (G C H) -1 ] > o [GrGc' 1 ] 
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where the ju arguments have been supressed. Good performance within the oper- 
ating frequency region (i.e., the "control bandwidth") is provided when the 
performance measure 

PM « o [I ♦ G c h] (5) 

is large. The stability measures (i») are generalizations of Nyquist polar 
plot analysis; the measures (5) are generalizations of Nyquist inverse polar 
plot analysis. The need for large performance measure (6) is a generalization 
of the desirability of large loop gains. 

ROBUST COMPENSATION DESIGN 

The stability and performance measures presented above require stac, lity of 
the nominal feedback system. In a previous work £*1], the authors presented a 
recursive design procedure which assigns the closed-loop eigenstructure in 
linear quadratic regulators. At each stage, the required solution for the 
steady state hiccati matrix which shifts a pole or po A e pair to specified 
values is obtained. For pole pair placement, a free parameter in the solution 
permits selection of closed-loop eigenvectors. This design procedure is sum- 
marized in Appendix 1. 

Using duality, the procedure also applies to estimator design. By extension, 
the procedure can be used to design stable compensators by considering the 
closed-loop regulator dynamics matrix (A-BK) as the open-loop system and pick- 
ing the estimator gain to place the compensator eigenstructure of (A-BK-GC). 
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Compensator robustness can be enhanced through the use of frequency-shaped 
control and estimation [5,6]. In frequency-shaped estimation, a fre- 
quency-domain perf ormance index is considered, 


1 00 

J = E — / [w'Q(ju)w + v' R(jw)v] du> (7) 

2n -* 

where w is the disturbance and v is the sensor noise. Sensor r.oise fre- 
quency-shaping is re'll ized by treating v as an auto correlates noise source of 
the form 


v(j w) 


= R 



u) V(ju) 


(3) 


where v'(ju) is a white noise process. In the approach used here, Q(jui) is 
determined by pole placement, equivalent to injecting fictitious process 
noisj. Rl/2(j u) ) must be proper (not strictly proper) to maintain sensor noise 
weighting over the entire spectrum. Then define a pseudo-measurement 

z' - R-1/2 (j u ) z - R-1/2(J U ) Cx(Jid) ♦ v' (ju) (9) 

R" 1// 2 (ju) can be realized in state space as 

J 

Xy ■= Ay Xy + ByCX 

y * Cy Xy + DyCX (10) 

2’ * CyXy + DyCx + V* 
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This dynamic model is appended to the system dynamics to form the frequency- 
shaped estimator, 

x * Ax + G(z’ - C V X V - D v Cx) + B c u 

( 11 ) 

4 

Xy * AyXy + ByCX + Gy (Z* “ CyXy — DyCx) 


where z' is obtained from (10). The gains G and G v t jn be kicked to plav'. the 
eigenvalues of (11) at those of the frequency-shaping filter (10) and the oth- 
ers as required for performance. A dual result can be used to develop fre- 
quency-shaped gains for the regulator. 

Because frequency-shaping ada... states to the compensator, an efficient choice 
of the loops to be shaped Is desirable. Kim [7] has developed a procedure for 
loop selection based on the singular vectors of the return ratio matrices G C H 
or HG C . Me conjectured that an input vector y in the direction of qj , the 
singular vector corresponding to "oik) will get the largest amplification by A. 
Similarly, a vector in the direction of q n , the singular vector corresponding 
to _o(A) will get smallest amplification. Therefore, if the component of y In 
the direction which is closest to qi , is reduced by a filter before it enters 
A, ~(A) is effectively reduced. a( A) increased by increasing the compos 
of y closet tc q n before it enters A. It > -n be shown that frequency-shaping 
introduces transmission zeros into the compensator transfer function. 
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DESIGN METHODOLOGY 


The discussion which has been presented above suggests the following design 
methodology : 

. Compensator design for performance of the reduced order system. 

2. Eva* .ation of the stability margins (4,5) against the perturbation due to 
the residual dynamics. 

3. Selection of frequency-shaping filters to enhance stability robustness. 

4. Synthesis c: . equency-shaped compensator to incorporate performance and 

stability margins. 

The recursive eiger.structure design algorithm can bo used for the designs. 


EXAMPLE. 


Tne design methodology was applied to a control design for the ACOSS-1 model, 
albj u'.ed in the comparison study [9]. The model is illustrated and the 
state-space data are listed in Appendix 2. As in the comparison study *'he 
first eight structural modes were retained. A regulator was designed with 
ciosed-loop poles at 20$ damping; a compensator was designed with poles at 
ritical damping. Fig. 3 illustrates stability measure (5) for the loop 
broKen at the ouunut. Performance s adequate at low frequencies but stabil- 
ity robustne:- is inadequate above 1 Hz. 


70 



To improve stability robustness, frequency-shapea estimation was incorporated 
in all three output loops using second-order low-pass filters. Fig. 4 illus- 
trates the recovery of stability robustness while still retaining good low 
frequency performance, Fig. 5. 


DISCUSSION 


In the comparative study by Kosut, et al [8], both LOG modal control and a 
frequency-shaped control were investigated (along with others). LOG control 
was found to have poor performance as well as poor stability robustness. Fre- 
quency-shaped control was found to have adequate stability robustness, but 
poor low frequency performance. 

The methodology presented here addresses both of these issues. Performance is 
achieved by pole placement design of the compensator, achieving good loop 
gains at low frequency. Stability robustness is achieved by adding fre- 
quency-shaping without sacrificing low frequency performance, since the gain 
of the frequency-shaping filters is one at low frequencies. 


CONCLUSIONS 


A design meth dology for control systems for large space structures has been 
proposed which incorporates both performance and stability robustness concerns 
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as an integral part of the design process. Performance was achieved by plac- 
ing the poles of the compensator. Stability robustness was achieved by fre- 
quency-shaping the compensator to satisfy a frequency domain stability 
robustness test. 

An example was pr isented which applied the methodolgy to a system with the 
loop broken at the output. A full design study would also require examination 
of the system with the loop broken at the input, using regulator fre- 
quency-shaping to enhance robustness. 
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APPENDIX 1 


Recursive Eigenstructure Design 

The steady-state optimal control law for the linear, time-invariant, 
controllable system: 

x = Ax + Bu (A.1) 

which minimizes the quadratic performance index, 

J = 1/2 lxT q x + U T Ru] dt (A. 2) 

is linear state feedback 

u * Kx = - R'l bT sx (A. 3) 

where S is the solution of the steady-state Riccati equation, 

-SA - AT S + SBR-1 BTS - Q - 0 (A. U) 

In this appendix we summarize an interactive design technique which solves 
(A. 4) to provide specified eigenvalues of the closed-loop system dynamics 
matrix A+BK and which also permits some freedom in selecting closed-loop 
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eigenvectors. The method is reported elsewhere [4] in detail. It extends the 
procedure of Solheim [10] in which, for fixed R, the elements of Q providing 
the required pole placement are calculated directly. 

The design technique is recursive; at each stage, the system dynamics matrix A 
in ( A. 1 ) incorporates previous state feedback. We then implement the follow- 
ing eigenstructure calculation: 

X"1[A - HS]X = A (A. 5) 

where A = T -1 AT is block diagonal, T is the real eigenvector matrix of A, and 
H = T~1 br _1 B t T~ t is symmetric and positive semi-definite. A is identical to A 
except for a block of shifted poles. X is the transformation from open-loop 
eigenvectors to closed-loop eigenvectors; it is defined as the "stage" 
eigenvector matrix. ? is the Riccati matrix in the open-loop diagonalized 
coordinate system; S is chosen to shift a single pole or a pair of poles. The 
corresponding gain matrices, K, de ermined for each stage are subsequently 
added to obtain a final gain which achieves the same closed-loop pole loca- 
tions. 

To provide the required pole shift, the only non-zero elements of S correspond 
to the entries of A which are to be shifted. With this ohoice of S, the char- 
acteristic equation factors into the product of terms for the unshifted poles 
and a term for the desired shifted poles. Thu~, 

f si - a! - D(s) n (s - Ai) (A. 6) 

iel 
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where I is the index set for the unshifted poles, and D(s) contains explicit 
elements of S, H, and the block of A which is to be shifted. Matching the 
coefficients of powers of s in D(s) to the equivalent terms in the closed-loop 
characteristic equation provides a set of equations in the required elements 
of S. For the single pole shift A jj - * to ic, the only non-zero element of S 
satisfies 


For double pole placement it can be shown that the three required elements of 
lie on the intersection of two quadric surface in a mathematical space hav- 
ing the three S elements as coordinates. (It can also shown that a direct 
solution for Q has a similar geometric interpretation.) If the corresponding 
submatrix of H is positive definite, the surfaces are a plane and a hyper- 
boloid of one or two sheets; the inters:;' -Ion, if it exists, is always an el- 
lipse. If the relevant submatrix of L is singular, the surfaces are planes, 
and the intersection is a line. The different points comprising the solution 
all provide the desired eigenvalue placement, but with different eigenvectors. 

In ref. ^ a solution for S is presented which takes advantage of the quadric 
surface geometry to define a free parameter that allots design freedom in the 
choice of closed-loop eigenvectors. The solution for the stage eigenvector X 
partitions into two sets of equations. The first is a homogeneous Lyapunov 
equation for the submatrix corresponding to the shifted pole block in A. For 
> role pair shift, the submatrix is 2x2. Hence, depending upon the nature of 
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the closed-loop poles (real or complex), one or two elements of the submatrix 
may be chosen arbitrarily; the remaining elements then depend :-n the choice of 
elements of S. The other equation is a non-homogeneous Lyapunov equation in 
the remaining elements of the columns of X containing the 2x2 submatrix; its 
solution depends upon the 2x2 submatrix, the elements of S and A, and certain 
elements of H. 

The closed-loop system eigenvector matrix is then Tql - TX. The solution of X 
depends upon S, which varies with the choice of the free parameter. There- 
fore, by recursively shifting pole pairs, design freedom exists to select 
closed-loop eigenvectors while providing required pole placements. 

The procedure outlined above lends itself to a recursive procedure for practi- 
cal multivariable regulator design. The steps in the procedure are as fol- 
lows: 

1. System (A.1) is placed in modal form. 

2. The designer selects the control weighting matrix R, then H is calculated. 

3. The designer selects a real pole or pair of poles to be shifted and their 
desired location; • a pair, he also selects the free parameter which 
determines the closed-loop eigenvectors. 

4. The stage gain is calculated and the closed-loop system is placed in modal 
form. 
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5. Steps 3 and 4 are repeated for other poles until the designer is satis- 
fied. 

6. The total system gain is obtained by adding the stage gains. 

Clearly by duality, the same process can be applied to estimator design, per 
mitting the development of multivariable compensators. 
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APPENDIX 2. 


The ACOSS-1 flexible spacecraft model was developed by the Charles Stark 
Draper Laboratory . 10 it is representative of many radar and optical control 
problems, but is small enough to be tenable for research studies. The struc- 
ture is a tetrahedral truss supported by three right-angle bipeds. The truss 
members are flexible in the axial direction only. The model has 12 modes; for 
control design, only eight are assumed to be -known. 
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Residual Dynamics 
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Ffg. 3 Reduced - Order Control Stability Margin 
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ABSTRACT 


Robustness properties are investigated for two types of controllers for 
large flexible space structures, which use collocated sensors and actuators. 
The first type is <tn attitude controller which uses negative definite feedback 
of neasured attitude and rate, while the second type is a damping enhancement 
controller which uses only velocity (rate) feedback. It is proved that collo- 
cated attitude controllers preserve closed-loop global asyaptotic stability 
when linear actuator/sensor dynaaics satisfying certain phase conditions are 
present, or aonotonic increasing nonlinearities are present. For velocity 

feedback controllers, the global asyaptotic stability is proved under auch 
weaker conditions. In particular, they have 90* phase aargln and can tolerate 
nonlinearities belonging to the [0,») sector in the actuator/ sensor character- 
istics. The results significantly enhance the viability of both types of 

collocated controllers, especially when the available inforaatloo about the 
large space structure (LSS) parameters is inadequate or inaccurate. 


INTRODUCTION 


Large flexible space structures are infinite-diaensional systems with very 
snail inherent energy dissipation (daaplng). Because of practical limitations, 
only finite-dinensional controllers and point actuators and sensors must be 
used for controlling lsrge space structures (LSS). In addition, considerable 
uncertainty exists in the knowledge of the paraaeters. For these reasons, the 
design of a stable controller for a large space structure (LSS) is a 
challenging problem. 

A class of controllers, terned “collocated controllers’* [1], represents an 
attractive controller because of its guaranteed stability properties in the 
presence of plant uncertainties. Collocated attitude (CA) controllers are 
designed to control the rigid-body attitude as well as the structural aodes, 
while collocated direct velocity feedback (CDVFB) controllers are designed only 
for enhancement of structural damping. Both types of collocated controllers 
guarantee stability regardless of the number of aodes in the LSS model and 
uncertainties in the knowledge of the paraaeters f 1 1 , [2]. A CA controller 
basically consists of compatible sensor/ actuator pairs placed at the same 
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locations, and utilizes negative definite feedback of position and velocity 
(e.gi, LSS attitude and attitude rate). A CDVFB controller (3] is a special 
case of the CA controller where only rate feedback is used for damping 
enhancement without affecting the rigid-body modes. It has been proved in 
references [ 1 ], [2], [3] that, the closed-loop system is always stable in the 
sense of Lyapunov, and is also asymptotically stable (AS) under certain 
additional conditions. 


Although collocated controllers have attractive stability properties with 
perfect (i.e., linear, instantaneous) sensors and actuators, the sensors and 
actuators available in practice tend to have nonlinear! ties and phase lags 
associated with them. In order to be useful in practical applications, the 
controller should be tolerant to nonlinearities (e.g., saturation, relays, 
deadzones, etc.), and to phase shifts (e.g., actuator dynamics and/or computa- 
tional delays). Uncertainties usually exist in the knowledge of the nonlinear- 
ities and the phase lags. For these reasons, this paper investigates the 
closed-loop stability of collocated controllers in the presence of unmodeled 
sensor/ actuator dynamics and nonlinearities. The situation is mathematically 
described by Including an operator X in the feedback path. The actual input 
u't) *s given by: 

u(t) "^fu^t) ( 1 ) 

where is the ideal (desired) input, ie is a nonanticlpative, linear or 
nonlinear, time-varying or invariant operator. For CA controllers, it is 
proved that the closed-loop system is globally asymptotically stable if 

1 ) X is linear, time-invariant (LTI) and stable with a rational transfer 

matrix H(s) which satisfies certain frequency-domain conditions, or 

2) If X consists of time-invariant, strictly monotonlc increasing 
nonlinearities belonging to the [0, •) sector. (A function 6 (o) is said 
to belong to the [k h) sector if 6(0) * 0 and ko^ jC 06 ( 0 ) < ho^ for all 
a * 0 ). 


For CDVFB controllers, it is proved that global asymptotic stability is 
preserved when 


n K is a stable nonlinear dynamic operator and satisfies certain passivity 
conditions, or 

2) X is a 8 tab J- e LTI operator with phase within ±90* 

3) St consists of nor .Inear gains belonging to the ( 0 ,*) sector. 


These analytical 1 ults significantly enhance the stability and 
robustness properties of collocated controllers, and therefore increase their 
practical applicability. 
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PROBLEM FORMULATION 


The linearized equations of notion of a large flexible space structure 
(using torque actuators) are given by: 


Ax + Bx + Cx « £ rju, 

i-1 1 1 

(2) 

X “ <V 6 s» *s* q l* q 2* * * * 

(3) 

A - diag (1,1 ) 

s nq x nq 

(4) 

B - diag (0 3 x 3 , D) 

(5) 

C - diag (0 3 x 3 , A) 

(6) 

r i * x 3' 

(7) 

“i * <u xi’ V "«i )T 

(8) 


where + 8 , 0 g , denote the three rigid-body Euler angles, n^ is the 

nuaber of structural nodes, q t denotes the nodal anplitude of ith structural 
node (i - 1, 2, . . , n_), I # denotes the 3x3 nonent of inertia aatrlx, 
is the 3 x n_ node-slope natrix at the ith (3-axis) actuator location. 
It is assuned that a, 3- axis torque actuators are used. It x 1 denotes the £ 
x i identity natrix, and diag( ) denotes a block-diagonal aatrlx. D is a 
syxaetrlc positive definite or seal definite aatrlx which represents the 
inherent structural daaplng. Since soae daaping, no natter how snail, is 
always present, we assuae D > 0 throughout this paper. A is an n^ x n^ 
diagonal aatrlx of squared structural frequencies 

A - dlag («*, “ n q> < 9 > 

As 8 unlng that a, 3- axis attitude and rate sensors (e.g. , star trackers and rate 
gyros) are placed at the locations of the actuators, the aeasured 3-axis 
attitude y a £ and rate y r j at actuator location 1 (ignoring noise) are 
given by: 


y 

y 


m T X 

(10) 

fil 1 

• 

ri * r i* 

(11) 


denoting 


T t I, T 

u ■ lUj, Uj. • • » u B l 
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( 12 ) 



( 13 ) 


r T . rr T r T r T i 

1 i» 2* * * ’ m J 


T T 

al’ y a2* 

. . , y T ] T 
* am 

(14) 

T T 

rl 1 y r2’ 

T,T 

• • » 

rm 

(15) 


where u, y a , y r are 3m x 1 vectors, and I* is a 3m x (n q + 3) matrix. The 
control law for the collocated attitude controller is given by: 


u = u + 11 
c cp cr 


u * -G y 
Cp pa 


cr 


-G y 

v J x 


(16) 

(17) 

(18) 


where u c represents the command input, u cp and u cr represent command 
attitude and rate inputs, and Gp, G r are 3m x 3m feedback gain matrices. 

For CDVFB controllers, the rigid-body rates are removed from the feedback 
signal by subtracting attitude rates at two locations. Consequently, the model 
used for damping enhancement has the form: 


q + Dq + Aq ** *^u (19) 

where f consists of appropriate differences between the mode-slopes. The 
control law is given by: 

u m -Gy (20) 

c r 

where 

y r - *q (21) 

The control laws given above for C A and CDVFB controllers have very 
attractive robustness properties. It was shown in [1], [2] that, if D > 0, 
Gp ” G p T > 0, and G r “ G r T > 0, then the closed-system is 

asymptotically stable (AS) > The stability result holds regardless of the 
number of modes in the model, and regardless of inaccuracy in the knowledge of 
the parameters. In real life, however, nonl inear i ties and phase lags exist in 
the sensors and actuators, which invalidate these robust stability properties. 
The real problem then is to investigate the closed-loop stability for the case 
where the actual input is given by Eq. (1), where ?C is a nonar.ticlpative, 
linear or nonlinear, time-varying or invariant operator. The situation is 
shown in Figure 1. Our approach is to make use of input-output stability 
concepts and Lyapunov methods. Me assume throughout the paper that the problem 
is well-posed, and that a unique solution exists. We start by defining the 
terminology and the concepts, which are adopted from [4]. 
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MATHEMATICAL PRELIMINARIES 


Consider the linear vector space Ln 2 of real square-integrable 


n-vector functions of tiae t, dt fined as: 

L n " R + * R "|/o 8 T < t >8( t > dt < *} (22) 

where R n is the linear space of ordered n- tuples of real numbers , and R 

denotes the interval 0 < t < •. The scalar product is defined as 

<«! . E 2 > " 8^ e) 82 (t)dt (23) 

For g^Lf, 2 , its nora is defined as 

■g' - <g,g> 1/2 (24) 

Define the truncation operator P T such that 

. Ig(t) 0<t<T 

g T (t) - P-g(t) - (25) 

1 1 0 t > T 

Define the extended space L^ 2 : 

L ne " f g: IR + +R " | % €L n^ T ^ °1 (26) 


Thus Lj, e 2 is a linear vector space of functions of t whose truncations are 
square-integrable on [0,T) for all T < •. For gj , g 2 e L,j e 2 , define 
the truncated inner product 

<g l » *2^ " ^lT’ *2^ " ^ *J< t >« 2 < t > dt <27) 

° 1/2 

The truncated nora is defined by: Iglj ■ <g»g >7 • 

Consider an operator fa : Lne^-^Lne 2 • 1® said to ** strictly 

passive if there exist finite constants $ and 6 > 0 such that 

< fag, g> T > 3 + filg* 2 V T>0, Vg £ L n 2 (28) 

fa. is passive if 6 ■ 0 in (28) . 
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ROBUSTNESS OF COLLOCATED ATTITUDE CONTROLLERS 


Stability With Dynamic Operator in the Loop 

We consider the case where the operator is linear and tine-invariant 
(LT1), and has a finite-dinensional state-space representation. We denote 
by ^C(z 0 ; g) where z Q is the initial state vector of ff , and assume m * 1 
for simplicity (i.e., one 3-axis actuator). 

Theorem 1 . Suppose ft is a non-anticipative, strictly stable, completely 
observable, LTI operator whose transfer matrix is H(s) - el+H(s), where e > 0 
and H(s) is a proper, minimum-phase, rational matrix. Under these conditions, 
the closed-loop system given by Eqs. (1), (2), (10), (11), ( 16)— ( 18) is 

asymptotically stable (AS) if 


H(jw) ( <dg^ - jGp) + («*C r + jGp) H*(j«) 0 for all real w. 

where * denotes the conjugate transpose. 

Proof - Define the function 


(29) 


T *T • 

V(t) » x Cx + x A x 


(30) 


Since C > 0, A > 0, V(t) 0 for all t ^ 0. Differentiating V with respect to 
t, and using (1), (10), (11), (16)-(18), 


V = - 2x T Bx - 2u^ G f l /£[z q ; u c J (31) 

whereof also depends on its initial state z Q . Since ft is linear, 

# lz o ; U C ] “ h 0 (t) + ^ t0; “c 1 (32) 

where h 0 (t) is the unforced response of due to nonzero initial state. 

Since iC i® strictly stable, > hi 0 is finite for any finite z Q . 

Substituting (32) in (31) and integrating from 0 to T, since V(T) 0, 

-1 

0 < V(T) - V(0) - 2<x, Bx> t - 2 <u cr , G r h Q > T 


where 


- 2<u , G -1 if u >_ 

cr r ^3»p cp T 


# p u cp “ < G P + u cp ] 


cp 


(33) 


(34) 
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m ♦ 

In (34) , "s' denotes the derivative operator. Cfe* is technically noncausal; 
however, this difficulty can be overcome ty defining the derivative of a trun- 
cation at T to be equal to that of the untruncated function.) Using Parseval's 
theorem. 


an 

<V °t 1 #p u cp > T ' TT 1 °cr «“> °r‘ »«“> <° p ♦ 


a y 

/ 0. * <J“) G" 1 H(jco) [-&- + Gl D (j“)dw 


2* 


cr ^ r 
T 


J« r J cr- 


oc Q 

■ “ST" f U cr <Ja,) ^r 1 + Gj.) 

00 T 

G * . 

+ (-^s~ + G ) H (ja>) G" 1 } 0 (jaj)dai 
jw r r cr 


The matrix in the brackets is positive (from Eq. 29), and we have 

<u , G * u >_ > c lu I 2 

cr* r /^-p cp T — cr T 

which yields (from (33) 


(35) 


0 < V(o) -2 <q, Dq> T -2 e 


»u ij -2 
cr T 


-1 


<u 


cr* 


h >_ 

o T 


(36) 


wherein we have used the fact that x^Bx « q Dq. Therefore, 

X m (D) lqt T + elu cr , T — v(o>/2 + '“crV^W (37) 

where I l g denotes the spectral norm of a matrix, and ^ denotes the 
smallest eigenvalue. Eq. (37) can be written as 


V» 'i'i * ‘Vcr'x 



) 2 < V(o)/2 + c^ACj 


(38) 


where cj ■ y/t and C2 ■ 
(t) ■ 0. Denoting the 

implies that lima (t) ■ 0. 
t-"». 


t-**> 


Aq 


1 h 0 1 Therefore, lim q (t) - 0, and lim 
rigid-body attitude t ’*” o - ($ 8 , 9 g> a )T f 

Taking the limit of the closed-loop equation 



this 

as 


(39) 
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where the overhead bar denotes the limit as t-**. From (39), ^?u * 0 and 

q = 0, which yields S’ * 0. Since St is observable and its output tends to 
zero, its state vector tends to zero as t-*», and the 'system is asymptotically 
stable. 

The followiug corollary essentially states that, for diagonal G_, G r , 
and H, it is sufficient that the phase lag of H(jou) is less than the phase lead 
introduced by the controller. 

Corollary 1.1 . Suppose G p , G r and H are diagonal and satisfy the 
assumptions of Theorem 1. Then the closed-loop system is globally 
asymptotically stable if 

. wG » wG 

-tan' 1 -—E± < Arg {h.(»} < 180° -tan' 1 for all real w (40) 

pi pi 

where Arg( ) denotes the phase angle of a complex variable. 

For the case where Hu (s) * kj/(s + a*), with kj, a* > 0, 
condition (40) becomes 

> l/. t (41) 

pi 

Thus, for the case of first-order sensor/actuator dynamics, the system is 
asymptotically stable if the ratio of rate-to-proportional gain is at least 
equal to the magnitude of the actuator pole. 

In Theorem 1 and Corollary 1.1, the transfer function of was assumed to 
be of the form: H(s) * el + H(s), where e > 0. That is, a direct transmission 
term, no matter how small, was present. From Theorem 1, the closed-loop system 
is AS for any e > 0. Therefore, the closed-loop eigenvalues are all in the 
open left half-plane (0LHP). Because of continuity, ic is obvious that, when 
e « 0, the eigenvalues will not cross the imaginary axis* That is, the eigen- 
values viii be in the closed left half-plane (CLHP). Theorem 2 given below 
considers the case when e - 0. It essentially shows that, if the closed-loop 
system with no elastic modes is AS with S& in the loop, then so is the system 
with elastic modes, provided that (29) is satisfied with H replacing H. 

Theorem 2 . Suppose X is a non-anticipative, strictly stable, completely 
observable, LTI operator with rational transfer matrix H(a) which is proper and 
minimum-phase. If the closed-loop system for the rigid body model alone (i.e., 
Eqs. '1), (2), (1C), (11), (16)— ( 18) with n q - 0) is AS, then the entire 

clo8v loop system (i.e., with n q * 0) is AS provided that 

H(joi) (wG r - JG p ) + (<oG r + jG p ) H* (jw) 2 0 for 811 real w (42) 

Proof . Considering the rigid-body equations, 

I s a - #u c - (u a + u q ) (43) 
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where Uo - ” GpO - G r o and tt- ■ -Gpfq-G r 4q. Thus the transfer 

function fron q to a Is given by 

M(s) - ll + H(s) {G p ♦ G^lr 1 H(s) {G p + G r s} * 

Since the closed-loop rigid-body systea Is strictly stable by assumption, M(s) 
is strictly stable and finite-gain, which inplles 

»“* T < Y*ql T + »h m l T (44) 

where t is the gain of M and h^ is its free response. Proceeding as In the 
proof of Theorem 1, we can arrive at Eq. (37) wherein e ■ 0 and n, it, 
replaced by Since u^ - -G r (o + *q) , we have fron (44) , 

'V't < VA + c 2 'h.' T <«) 

where cj and C£ are positive constants. Completing squares as in (38) and 
noting that lh.1 is finite, it can be proved that Iqlf is bounded for all 
T > 0, and that li m JktW). Proa (45), u CT also tends to zero as t+**. The 
remainder of the proof is similar to that of Theorem 1. 

Corollary 2.2 With the saae assumptions as in Theorem 2, if Gp, Gr, and H 
are diagonal, then the closed-loop system is AS if (40) is satisfied with H 
replacing H. 

Proa Corollary 2.2, for the case where Hu(s) ■ kj/(s + a^) with 
kj_, a^ > 0, the closed-loop asymptotic stability is assured if G p £ < 
a i G ri for i - l t 2, . . , a. ~ 

The significance of the results of this section is that the stability can 
be assured by making the ratio of the rate-to-proportlonal gains sufficiently 
large. One has to know only the sensor/actuator characteristics, and the 
knowledge of the plant parameters is not required . This result is completely 
consistent with the result obtained In 1 5J Tor single-input, single- output 
systems, for snail Gp and Gr, using a root-locus argument. 

The next section considers the case where nonllnearlties are present in 
the loop. 


Stability la the Presence of Nonllnearlties 
Suppose Bq. (1) is replaced by 

u ■ iKu ) (46) 

c 

where ♦ is an n-vector, one-to-one, tiae-invarlant function, p: SF+I*, as 

follows: 
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<Ko) - +2 (o^), . . , 


+ (0 _)] 


(47) 


Por this case, the stability of the closed-loop system can be investigated 
using Lyapunov methods. A function 0(v): r1-*-r1 is said to belong to the 
(0, 00 ) sector if 0(0) * 0 and v0(v) > 0 for v * 0. 0 is said to belong to the 
[0, 00 ) sector if 0(0) - 0 and v0(v) 0 for v * 0. [Fig. 2] Many nonlineari- 

ties encountered in practice, such as saturation, relay, dead-zones, belong to 
the [0, 00 ) sector. As in the previous section, we assume that the problem is 
well-posed, and that a unique solution exists, and we consider the case with 
one 3-axis actuator for simplicity. 

Theorem 3 . Consider the closed-loop system given by Eqs. (2), (10), (11), 
( 16)— ( 18) , and (46), where G p and G r are positive definite and diagonal, 
and each is in the (0, ®) sector and is strictly monotonic increasing for 
i - 1, 2, • . . . , m. Then the closed-loop system is globally asymptotically 
stable. 


Proof . Define 

3 

V(x, x) - x T Cx + x T Ax + 2 £ G j / Cpi ^ (v)dv (48) 

i-1 p o 1 

where G p i and u cp ^ denote the iith and ith elements of G p and 

u cp , respectively. This form is the well-known "Lure'-type" Liapunov 
function [6]. Prom Eqs. (4) and (6), x^Cx + x^Ax - 0 only when u • 0, 
q * q - 0. That is, this quantity can be zero when o * 0. However, when q - 
0, u cpx - G pi a, which is nonzero when a * 0. Thus the third term on the 

right hand side of (48) is positive (since is in the (0, •) sector) for a 
* 0. Therefore, V is positive definite. From (48), using (2), (46), 

( lb)-( 18) , 


5 - - 2 j i '‘ crl «rt V%1 • \ rl > - ^V-cpApi 


(49) 


* “1 

Since u . ■ G , G . u . , we have from (49): 
cpi pi ri cri 


V - -2x T Bx - 2 t « cr l G rt [ *l ( “cpl + “cri 5 • ♦i (u cpl )1 


i-1 


cpi 


(50) 


Since la strictly monotonlc increasing, 

V < -2q T Dq (51) 

V-0 only when q-0 and u cr ^ - 0, which implies a - 0. Considering the 
closed-loop equation, 


0 

Aql 


I 

*T| 


<Ku 


cp) 


(52) 
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whicn yields ^ (u cpi ) “ 0 and # q - 0. Since ^(V) - 0 only at v • 0, 

this implies that a - 0. Thus V = 0 only at the origin, and the system is 
globally asymptotically stable. 

Thus the collocated controller is guaranteed to be globally asymptotically 
stable in the presence of monotonic increasing nonlinearities. This »* ■ 

of the nonlinearities is also called "incremental passivity." As ae j n t.i 
previous section, if the nonlinearities are replaced by dynamic operat s, mere 
incremental passivity is not sufficient for stability. 


ROBUSTNESS OF VELOCITY FEEDBACK CONTROLLERS 


Stability with Dynamic Operator in the Loop 

Consider the case where a nonlinear dynamic operator X?(z 0 ; v) is 
present in the loop. Suppose is represented by the following state-space 
model : 


z =* f(z, V, t) , z(n) » z 

o 


(53) 


w(t) « p(z, t) (54) 

where v and w are 3m x 1 vectors which are the input and the output of jc. 
Define the operator 

( z 0 » 8) “ <#( z o ; 8) - <^(* o ; 0) (55) 

We define to be internally stable if I (*oi 0)1 18 finite for any 

finite z Q . 

Theorem 4 . Consider the system given by Eqs. (1), (19), (20) (21), where the 
operator /c has the state-space representation given by ; J3) , (54) . Suppose 
is passive and is uniformly observable, finite-gain, internally 

stable, continous operator. Then the closed-loop system is globally 
asymptotically stable. 

Proof. Defining 


V(t) ■ q T Aq + q T q (56) 

V(t) > 0 for all t > 0. Differentiating V(t) with respect to t and using Eqs. 
(19), (20), (21) and (1), 

V - “2q T Dq - 2u^G^ #(z q ; u^) (57) 

Integrating from 0 to T , since V ^ 0, 
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( 58 ) 


0 < V(T) - V(0) - 2<q, Dq> T - 2<u cr , g" 1 #( z q \ u cr » T 
which yields (after manipulation) 

2 V D) '1'? 1 V(0) " 0 + 2, q , T , * , 8 l /£(*<,; °>' (59) 

where 0 is a constant (see Eq. 28)* 

By using a procedure similar to that in the proof of Theorem 1, it can ba 
proved that #ql is bounded , and that the system is globally asymptotically 
stable. 

The following corollary is an immediate consequence of Theorem 3. 

Corollary 4.1 » If & is a strictly stable, completely observable, LTI operator 
with rational, mi mi mum- phase transfer matrix H(s), the closed-loop system of 
Eqs. (1), (19), (20), (21) is asymptotically stable provided that 

* 

H(jw)G r + G^H (joj) 0 for all real w (60) 

Note that the above condition is equivalent to passivity of G r ~*^tf • 

Corollary 4.2 . Under the assumptions as in Corollary 1*1, if G r and 
are diagonal, the closed-loop system of Eqs. (1), (19), (20), (21) is 
asymptotically stable if 

RelH^jw)] > 0 for all real w 

As a result of Corollary 4.2, CDVFB controllers can tolerate stable 
first-order dynamics in the loop . If H^s) ■ e“ J ^ , we have 

Re[H i (j(u)J > 0 fo^ -90° < < 90*; tnerefore, CDVFL controllers have 90° 

phase margin. 


Stability in the Presence of Nonlinearities 

Suppose the operator XT in (1) is replaced by an m-vector nonlinear 
function ^ as in Eq. (47), except that ♦ is allowed to be tine-varying . The 
following theorem gives sufficient conditions for global asymptotic stability. 

Theorem 5 . Consider the closed-loop system given by Eqs. (i), (19), (20), 
(21), where G r is diagonal and positive definite, and each ^ belongs to 
the [0, ~) sector. Then the closed-loop system is globally asymptotically 
stable. 
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Proof. Starting with V as in Eq. (56), 


* - -2q T Dq - 2 t) (62) 

• • « 

Thus V < 0 f and V 5 0 only If q =0, which can happen (from the equations of 
motion) only when q = 0. Therefore* the system is globally asymptotically 
stable. 

The next theorem considers a special case when nonlinearities and 
first-order dynamics are simultaneously present In the loop* as shown in Fig. 
3. 

Theorem 6 . Consider the closed-loop system given by Eqs^, (1)* (19)* (20)* 
(21)* where G r > 0 is diagonal. Suppose fC m diag ( • - • • 

• • m)* where 

<#i g "*i ( ^i g) <63) 

where each 4*1 : is a time-invariant, differentiable function 

belonging to the [0, 00 ) sector, and there exists a constant K < • such that 
1*1/ < K over the interval (-*, “). Suppose is an LTI operator whose 

transfer function is: Gj(s) * ajO + p^s) - ^, ®i > 0, p^ > 0 for i « 

1, 2, . . ,a. Then the system is globally asymptotically stable. 

Proof . Starting with V as in Eq. (56) and proceeding as in the proof of 
Theorem 4, we have 

3m 

0 < HO) - 2<q, Dq> -2 l <u , i> ± { ^(0; u ) + g oJ }> (64) 

i*l 

where is the unforced response of due to nonzero initial state. 

Using <aean value theorem, Eq. (64) can be written as: 

. . 3m 

0 < V(0) - 2<q, Dq> T - 2 I <u cri> ^(0; u cri >}> T 

+ <u cri* ♦i <o) «oi > T (65) 

where u lies in the Interval bounded by - J^(0; Ug- 4 ) and 1^(0; u cr j) 
+ g ol . Noting that the operator ^ { Y'iCO; u crl )j i* passive {4), and 

simplifying, we have 


A B (D)lql£ < V(0)/2 + »*l B Klql T lg 0 l 
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where 


3a 

•g » - I Ki* < * 

0 i«l ci 


The remainder of the proof is similar to that of Theorea 4. 


( 66 ) 


CONCLUDING REMARKS 


Robustness properties were investigated for tw~ types of controllers for 
large space structures, which use collocated sensors and actuators. The first 
type is the collocated attitude (CA) controller, which controls the rigid-body 
attitude and the elastic motion using negative, definite feedback of aeasured- 
attitude and rate. The second type of controller is th ; collocated direct 
velocity feedback (CDVFB) controller for damping enhancement. Such controllers 
are known to provide closed-loop asymptotic stability regardless of the number 
of aodes and parameter values, provided that the actuators and sensors are 
perfect. This robust stability property was extended further in this paper by 
proving that the global asymptotic stability is preserved even when sensors/ 
actuators are not perfect. The CA controller preserves global asymptotic 
stablity when the sensors/actuators are represented by (1) linear, time- 
invariant dynamics which satisfy certain simple phase conditions, or (li) 
time-invariant, monotonic Increasing nonl Inearl ties belonging to the (0, *») 
sector. The CDVFB controller preserves global asymptotic stability inder much 
weaker conditions. In particular, CDVFB controllers have 90° phase margin and 
are tolerant to time-varying nonlinearities in the [0, •) sector. These global 
asymptotic stability results are valid regardless of the number of modes in the 
model and regardless of parameter values. Therefore, it can be concluded that 
these controllers offer viable methods for robust attitude control or damping 
enhancement, especially when the parameters are not accurately known. An 
important application of the collocated attitude controller would be during 
deployment or assembly of a large space structure, when the dynamic character- 
istics are changing, and during initial operating phase, when the dynamic 
characteristics are not known accurately. A robust collocated controller can 
provide stable interim control which can perhaps be replaced later by 'a 
high-performance controller designed using parameters estimated on orbit. 
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ABSTRACT 

Important progress In research and application of Adaptive Control 
Systems has been achieved in the last ten. years. The techniques which 
are currently used in applications will be reviewed. Theoretical aspects 
currently under investigation and which are related to the application of 
adaptive control techniques in various fields will be briefly discussed. 
Applications in various areas will be briefly reviewed. The use of adaptive 
techniques for vibrations aonltoring and active vibration control will 
be emphasized. 

I. INTRODUCTION 

The use of adaptive control techniques is motivated by the need of 
automatically adjusting the parameters of the controller when plant para- 
meters and disturbances are unknown or change with time, in order to achieve 
(or to maintain) a certain index of performance for the controlled system. 
While cals problem can be reformulated as a nonlinear stochastic control 
problem (the unknown parameters are considered as auxiliary states) the 
resulting solutions are extremely complicated. Therefore, in order to obtain 
something useful, it is necessary to make' approximations. Adaptive control 
techniques can be viewed as approximations for nonlinear stochastic control 
prohlems. Model Reference Adaptive Controllers (MRAC) and Self-Tuning 
Regulators (STR) can be considered as two approximations among other possible 
approximations. These two approaches to adaptive concrol problems have 
been t ct^nsively studied and they are well understood. These approaches 
have bfc.n proven to be usable in practice and an Important number of success- 
ful .^plications have been reported. However, some important theoretical 
r >biems still need further investigation and more experience utilizing 
these techniques in practice should be gained. 

As mentioned earlier the MRAC and STR approaches can be considered 
as possible approximations for the solutions of some nonlinear stochastic 
control problems. However, when making approximations, some hypothesis 
should be considered which can Justify these approximations. The basic 
hypothesis for MRAC and STR is of an algebraic nature: for any possible 
values of the plant (and disturbance) parameters, there exists a linear 
ontroller with a fixed complexity such that the plant plus the controller 
has the pre-specified characteristics. The adaptive control loop 
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will only search for the values of the tuned parameters of a controller 
whose structure has been fixed using a standard control design technique. 

The MRAC and STR techniques have been initially developed independently. 
Subsequently , connections between these two techniques have been investigated 
and emphasized. See Egardt (1980), Landau (1981), Landau (1982), Astrom 
(1983). For certain classes of problems these two approaches are equivalent. 
It is Important to note that the development of these two adaptive control 
techniques is largely based on the deep understanding of certain types of 
linear algebraic control design techniques and of an appropriate interpre- 
tation of the controller design strategy. 

A brief review of the underlying concepts and configurations used for 
MRAC and STR is given in Section II. The linear tracking and regulation 
problem is reviewed in Section III and this allows the definition of the 
structure of the controller. The structures of various adaptive control 
schemes are presented in Section IV. The parameter adaptation algorithms 
are discussed In Section V. Applications- are listed in Section VI. Current 
research trends are indicated in Section VII. 

II. MODEL REFERENCE ADAPTIVE CONTROLLERS AND SELF-TUNING REGULATORS 

- BASIC PRINCIPLES. 

Figure 2.1 illustrates the basic philosophy for designing a linear 
controller. The desired performance is specified in terms of the character- 
istics of a dynamic system which is a "realization" of the desired input- 
output behavior of the closed loop control system. The controller is 
designed such that the closed loop control system is characterized by the 
same parameters as those of the "desired" dynamic system. 

Since desired performance corresponds in fact to the output of the 
"desired" dynamic system which is p re-specified, the design problem can 
be recast as in Fig. 2.2. The objective is now to design a controller 
such that the error between the output of the plant and the output of the 
reference model (the dynamic system which has the desired characteristics) 
is identically null for identical initial conditions and such that an 
eventual initial error will vanish with a certain dynamics. 

These two interpretations of the linear control design in the case 
of a plant with unknown or varying parameters lead to two adaptive control 
schemes, shown in Figs. 2.3 and 2.4. Figure 2.3 is a a extension of the 
scheme given in Fig. 2.2 and is called (explicit) MRAC. The difference 
between the output of the plant and the output of the reference model is 
a measure of the difference between the real performance and the desired 
one. This information is used through an "adaptation -mechanism" (parameter 
adaptation algorithm) to directly adjust the parameters of the controller. 
This is a "direct" adaptive control scheme. 

Figure 2.3 is an extension of the scheme considered in Fig. 2.1 in 
the sense that a suitable controller can be designed if a plant model 
is estimated on-line based on the current input-output data available. 

This scheme is called STR and it ia Inspired by the separation theorem 
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A. Minimum Phase Plants 


Consider the S.I.S.O. discrete linear time invariant plant described 
by: 

a) deterministic environment: 

A(q 1 )y(k+D) - B(q _1 )u(k), d > 0, y(0) + 0 (3.1) 

b) stochastic environment: 

A(q" 1 )y(k+d) - B(q _1 )u(k) + C(q _1 )a)(k+d) (3.2) 

where: 

A(q 1 ) ■ 1 + a.q 1 + ... + a q n 
i n 

B(q _1 ) - b +b.q _1 + ... + b q~ m + 0 (3.3) 

o i m o 

C(q ^)- m 1 + c^q + . .. + c^q n 

^(q" 1 ) y(k+l) - 0 (3.4) 

where: 

C R (q' 1 ) - 1 + cjq" 1 + ... + c^q _n (3.5) 

is an asymptotically stable polynomial. 

In order to design the controller, we will consider two strategies, 
one using an explicit reference model as part of the control system and 
the other using a 1-step ahead predictor of the plant output which together 
with the controller will form an implicit reference model. 

Strategy 1 : Explicit Reference Model 

One considers an explicit reference model given by: 

C T (q 1 ) y M (k+l) ■ D(q 1 ) u M (k) (3.6) 

M 

where y (k) is the output of the explicit reference model. The design 
objective is: 

C R (q _1 ) e(k+l) -0 k> 0 (3.7) 


where 


e(k) - y(k) - y M (k) 


(3.8) 
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is the plant model error. It is obvious that Eq. (3.7) includes the 
regulation objective specified by Eq. (3.4) (for umOO = 0, e(k) » y(k)). 
Equation (3.1) with d » 1 can be rewritten as: 

C R (q _1 ) y(k+l) - [C R (q" 1 )-A(q" 1 )]yk+l) + B(q' 1 )u(k) 

■ R(q 1 )y(k) + b Q u(k) + B*(q _1 )u(k) (3.9) 

where 

n 

R(q _1 ) • C R (q 1 ) - A(q -1 ) - J (c^-a^q -1 * 1 - rj+^q -1 . . (3.10) 

B*(q" 1 ) - B(q' 1 ) - b Q (3.11) 

and Eq* (3*7) becomes: 

C R (q' 1 )e(k+l) - R(q _1 )y(k) + bu(k) + B*(q -1 )u(k) - C R (q" 1 )y M (k+l) - 0 (3.12) 
which yields the desired control 


C_(q 1 )y M (k+l) - R(q X )y(k) - B (q X ) ,k) 

u(k) - — 

b 

o 

Introducing the notation: 

♦o(k) ■ [u(k-l) ... u(k-m), y(k) ... y(k-n+l)] 

6 0 “ Ib l **• b m ’ r l **• 'n 1 
Equation (3.15) can be written: 

c R (q ' 1) y M( k+ 1) - eJ* Q (k) 

u(k) g 

o 

or in an equivalent form: 

C R (q _1 )y M (k+l) - 6 T «Kk) 
where: 

4>(k) T - [u(k) , ^(k) ] 

e T “ I b o* # 

The resulting control scheme is given in Fig. 3.1. 


(3.13) 


(3.14) 

(3.15) 


(3.16) 

(3.17) 

(3.18) 

(3.19) 
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Figure 3.1 
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Strategy 2 : Implicit Reference Model. 

This strategy is directly inspired by the separation theorem: one 

first designs an appropriate predictor for the plant output, and then a 
control will be computed such that the output of the predictor behaves as 
the desired output in tracking. 

First step : (predictor design). The predictor will be designed such that 

the 1-step ahead prediction error E(k+1) is defined by: 

e(k+l) - y(k+l) - y(k+l) (3.20) 

where y(k+l) is the predictor output and will vanish according to: 

c R ( q 1 ) £(k+l) - 0 ; k > 0 (3.21) 

Using Eq. (3.9) • one obtains from Eq. (3.21) that the l-step ahead predictor 
is characterized by: 

C R (q' 1 )y(k+l) - bu(k) + R(q _1 )y(k) + b*(q _1 )u(k) - 0 T *(k) (3.22) 

where R(q 1 ), B*(q *), 0, $(k) are given by Eqs. (3.10), (3.11), (3.18), 
and (3.19) respectively. 

Second step: (computation of the control). The control is computed such 

that y(k+l) ■ y**(k+l); where y**(k+l) is the desired output given by Eq. (3.6). 
One finally obtains: 

C R (q" 1 )y(k+l) - C R (q _1 )y M (k+l) - 0 T *(k) (3.23) 

and the control is given by Eq. (3.17) as expected. 

Because of the output of the predictor is equal to the output of the 
explicit reference model, the predictor plus the control will form an 
"implicit reference model." 

B. Tracking and Regulation in Stochastic Environment 

We will examine first the behavior of the controller designed in the 
previous section when the plant is subject to a stochastic disturbance of 
the type considered in Eq. (3.2). For d ■ 1 Eq. (3.2) becomes: 

A(q *■) y(k+l) ■ B(q *") u(k) + C(q ^) io(k+l) (3.24) 

Using the control given in Eq. (3.13) one obtains: 

C R ( q‘ 1 ) y(k+l) - C R (q _1 ) y M (k+l) + C(q _1 ) u»(k+l) (3.25) 
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Neglecting the effect of the deterministic disturbance (which vanishes 
with the dynamics defined by (^(q-l)) one can re-write Eq. (3.25) as: 


yflc+l) . y M (k+l) + gfaT . j „(k+l) 

. u H (k) + Ciai_ „ (k+1) 

C T (q > V 1 ) 


(3.26) 


Equation (3.26) shows the presence of two reference 
deterministic one for tracking by D ^-l^ whose input is 


My 


^(q" 1 ) 


signal u (k) and a stochastic one for regulation defined 
input is the white noise sequence <*>(k+l) . 


models : a 

the reference 


by 


whose 

Vq” 1 ) 


In general the objective of the design in a stochastic environment is 
to obtain a minimum variance tracking and regulation, l.e.: 


E { [y(k+l) - y M (k+l)] 2 } - min 


(3.27) 


From Eq. (3.26) it results straightforwardly that the objective of 
Eq. (3.27) is achieved if one chooses: 


C R (ql) “ C(ql) 


(3.28) 


which leads to: 

E { [y(k+l) - y M (k+l)] 2 } - E{u> 2 (k+1)} - o 2 


(3.29) 


For the case d > 1, the control can no longer be computed directly 
using the strategies given above since this will lead to a non-causal 
controller (future values of the output and input are Involved for the 
computation of the control at the Instant k). This problem can be avoided 
hy using a polynomial identify which allows us always to express the output 
y(k+d) in terms only of y(k); y(k-l)... and u(k), u(k-l) ... 

Consider the following polynomial identity: 


C R (q _1 ) - A(q -1 ) S(q _1 ) + q“ d R(q" 1 ) 


(3.30) 


which has a unique solu' 'on for the polynomials S(q 2 ) and R(q 
deg S(q“*) ■ d-1 where 


) for 
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(3.31) 


Stq" 1 ) - 1 + S^" 1 ... S d _ iq " d+1 

R(q _1 ) - r 1 +r 2 q" 1 ... r n q _u+1 (3.32) 

Using the identity of Eq. (3*30) in Eq. (3.9) for d > 1 one obtains: 

C R (q _1 )y(k+d) - R(q _1 )y(k) + bu(k) + B s (q _1 )u(k) (3.33) 

where 


B s (q _1 ) - B(q" 1 )S(q” 1 ) - b Q (3.34) 

Equation (3.7) for d > 1 becomes: 

C R (q~^)e(k+d) - R(q' 1 )y(k) + bu(k) + B^q” 1 ) u(k) 

- C R (q -1 ) y M (k+d) - 0 (3.35) 

which yields the desired control 

c R (q X ) y M (k+d) - R(q _1 )y(k) - B s (q _1 )u(k) 

u(k) g 

o 

c_<q’ 1 ) y M (Ictd) - e* * (k) 

- — r 2 S— (3.36) 


The control has the same structure as for the case d • 1 except that the 
polynomials R(q~^) and B (q - *) are different, as well as 0 O and t 0 (k): 

Note that the strategy presented above achieves a poles- 
zeros placement. 

C. Non-minimum Phase Plants 

In this case one can no longer assume that B(z ^) is asymptotically 
stable and therefore the zeros of the plant transfer function can no 
longer be cancelled. The basic control strategy (algebraic approach) is 
the poles placement technique without zeros cancelling. The basic relation 
for the design of the controller is the Bezout identity: 

A(q" 1 ) S(q* 1 ) + q _d B(q" 1 ) R(q* 1 ) - (q _1 ) 
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and the controller has the structure: 


S(q *) u(k) - j C R (q 1 ) y M (k+d) - R(q _1 ) y(k) 


0 


1 if B(l) - 0 
B(l) elsewhere 


For a survey of the control strategies for non-minimum phase plants, see 
Landau, M'Saad, Ortega (l r °l). 

IV. STRUCTURES OF ADAPTIVE CONTROL SYSTEMS 


In adaptive control schemes the fixed controller designed for the case 
of known parameters is replaced by an adjustable controller having the same 
‘ructure, i.e., the fixed parameter vector will be replaced by an adjustable 
parameter vector which for the case of the design considered for minimum 
phase plants is given by: 

0 T (k) - [b (k), 0*(k)] (4.1) 

o u 

and the corresponding control law will be given (either in deterministic or 
stochastic environment) by: 

C R (q _1 )y M (k+l) - 6*(k)A (k) 

u(k)--5 2 (4.2) 

b o (k) 

or: 

0 T (k) *(k) - (^(q _1 ) y M (k+l) / 4.3) 


See Fig. 4.1a. 

Note that in the case of schemes using an implicit (prediction) 
reference model (STR) the plant predictor will be replaced by an adaptive 
predictor governed by: 

C 2 (q _1 ) y(k+l) - 0 T (k) *(k) (4.4) 


and the control will be computed according to the strategy in the linear 
case with known parameters which will lead to Eq. (4.3). See Fig. 4.1b. 

V. PARAMETER ADAPTATION ALGORITHMS 

Various approaches have been considered for the development of parameter 
adaptation algorithms (PAA). A fairly general structure for the PAA is 
given by: 


0(k+l) - 0(k) + F R <Kk) v(k+l) 


(5.1) 
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Ill 



















v(k+l) 


(5.2) 


1 + 4(k) T F k ♦(k) 

F k+i “ x i (k) h 1 + x 2 (k) * (V) ♦ (k)T 
0 < A 1 (k) < 1 ; 0 < X 2 (k) < 2 ; F Q > 0 


(5.3) 

(5.4) 


Using the matrix inversion lemma: 

i F k * (k> 

k+1 A (k) lf k A (k) _ 

A^lO + * (k> F k *< k > 

* 

where P;k) is the adjustable parameter vector, F(k) is the matrix adaptation 
gain, 400 is the measurement or the observation vector and v°(lrt-l) and 
v(k+l) are the "a priori" and the "a posteriori" adaptation errors respec- 
tively. The"a priori" adaptation error is a measurable quantity which 
depends on 6(i) up to the instant k, and the "a posteriori" adaptation error 
which enters in the adaptation algorithm is not directly measurable (it 
depends on 6(kfl)) but can be expressed in terms of the "a priori" adaptation 
error as indicated in eq. (5.2). 

Different choices for Xi(k) and A 2 (k) are possible leading to different 
types of variations of the adaptation gains. The performances of the 
adaptive control systems in various situations depend upon the choices of 
these two parameters. For details see Landau, Lozano (1981) and Landau (1983). 


♦ <W T F fc 


VI. APPLICATIONS 

There are already a significant number of applications of adaptive 
control systems as well as a few commercial products. For references, see 
Astrom (1983), Landau (1981), Landau, Tomizuka, Auslander (1983), Narendra, 
Monopoly (1980), Unbehauen (1980). 

The adaptive control schemes can be used in three modes of operation: 

1) Auto-tuning of a linear controller in the case of plants with unknown 
but constant parameters. 

2) Building a gain schedule for unknown plants with dynamics depending 
on operating points. 

3) Adapting in real-time the controller for plants with unknown and time- 
varying parameters. 
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An important remark to be made is that adaptive control algorithms 
cannot be used in p actice without a priori analysis of the control problem 
corresponding to each tentative application. This analysis should give 
answers to two categories of questions regarding (a) the need of adaptive 
control and (b) specific design requirements. 

The main areas of applications are: 

- Grinding 

- Drying furnaces 

- Cement mills 

- Chemical reactors 

- Distillation columns 

- Diesel and explosion engines 

- Heating and ventilation 

- Paper machines 

- Power systems 

- Electrical drives 

- Autopilots for ships 

- Robotics 

- Heat exchangers 

- pH-control 

- Active vibration control 

An adaptive active vibrations control is described in Mote, Rahimi 
(1983). It uses first a recursive parameter estimation technique for 
estimating in real-time the parametric model of the composite vibration 
signal for circular plates (the vibrations frequencies). Then the parameters 
of the transfer from control heat to vibration frequency are estimated on- 
line and used for computing in real time the controller parameters. 

VII. THEORY 

The most complete theory is available today for the adaptive control 
of minimum phase plants achieving a poles-zeros placement. For this type 
of plant, tracking and regulation with Independent objectives can be 
achieved both in deterministic and stochastic environments. Both MRAC 
and STR approaches lead in this case to "direct" adaptive control schemes. 

The besic assumptions for the design of adaptive control systems for 
minimum-phase plants in deterministic and stochastic environments are 
summarized next. 

- Exact knowledge of the plant delay (d) . 

- Knowledge of an upper bound for the degree of A(q ) which is the 
denominator of the plant transfer function. 

- The zeros of the plant transfer function must lie within the unit circle. 

- A lower bound of the magnitude of the leading coefficient of the plant 
transfer function should be known. 

- The sign of the leading coefficient of the numerator plant transfer 
function is useful to be known (in order to avoid large adaptation 
transients) . 

- The stochastic disturbances are modeled by ARMA processes. 

- Asymptotic type convergence is considered. 
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However, in practice some of these assumptions cannot be reasonably 
satisfied, in particular, the need for knowing an upp'r bound for the 
denominator degree (which in many cases simply does net exist) and the 
requirement that the disturbance is of ARMA type. 

The use of reduced order models in adaptive control design is one of 
the main research topics today, and interesting results have been obfalnec 
leading to improved design techniques. See Ioannou (1983), Ortega, Ianuau 
(1983), Kosut (1983). 

The case of disturbances which cannot be modeled by ARMA , ^ocesses has 
also been considered. See, for example, Samson (1983), Peterson, Narendra 
(1982). 

Another aspect is the extension of the adaptive control design for 
the multi inputs - multi outputs systems. Except for trivial cases, the 
extension raises important parameterization problems for MDfO plants. A 
survey of the various designs available can be found in Dion, Dugard (1983). 
More a priori knowledge on the plant structure than in the S1S0 case is 
required, and the research is directed towards the development of adaptive 
control schemes requiring less a priori structural information. The Hermite 
form of MIMO transfer matrix plays a key role in understanding the multi- 
variable case. 

The case of adaptive control of non-minisum phase plants is more 
complicated both from the point of view of the complexity of the adaptive 
control schemes and of the analysis. A survey of the adaptive control 
techniques for this type of plant is given in Landau, M'Saad, Ortega (1983). 
Most of the schemes are of "Indirect" type, and the major question to be 
answered in order to show the convergence of the system is whether the 
estimated plant siodel converges towards the isodel with satisfactory proper- 
ties (stabilizable) . Global convergence results have been obtained, but 
with the requirement of using an additional persistent excitation signal, 
see Goodwin, Teoh, Innis (1982). The robustness of the adaptive control 
designs for non-minimum phase plants with respect to model reduction and 
ill-modeled disturbances has also been studied, see, for example, 

Praly (1983). 
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ABSTRACT 

A technique is presented for obtaining a control law to 
regulate the modal dynamics and identify the modal parameters 
of a flexible structure. The method is based on using a 
min-max performance index to derive a control law which may 
be considered to be a best compromise between optimum 
one-step control and identification inputs. Features of the 
approach are demonstrated by a computer simulation of the 
controlled modal response of a flexible beam. 

I. INTRODUCTION 

A class of indirect adaptive control systems proposed for the 
control of large space structures [1] is based on a modal 
decomposition of the system dynamics and may incorporate one 
or more on-line testing schemes [2] to determine when 
successful parameter identification has been achieve. The 
control strategy used in calculating the actuator inputs must 
achieve adequate regulation or tracking performance and, at 
the same time, provide inputs to allow adequate parameter 
identification. A ont*"ol system designer is thus faced with 
the problem of ■'••'vising a control strategy to ensure 
acceptable system performance even when on-line parameter 
identi f iabil i ty tests have failed because the system 
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configuration has changed or the environment In which the 
system operates has changed. 

In this paper we formulate and examine the performance of a 
nonlinear dual -adaptl ve control scheme In which a 
sampled-data controller Is designed to select a best 
compromise between an Input signal that Is optimum for 
mean-square system regulation and an Input signal that Is 
optimum for parameter Identification. Dual control theory, 
originally formulated by Feldbaum [3,4], has been studied In 
[5-7] and in the references cited therein. A key concept 
introduced by Feldbaum is the dual control strategy based on 
a performance Index that takes Into account the fact that 
future observations on the process will be made. A 
controller may be able to "probe" the system for state and 
parameter estimation Improvement, which then may improve 
future regulation and tracking performance. In many 
situations where the dual nature of stochastic control Is not 
taken into account the controller becomes "cautious" [5,6] 
and tends to "turn-off". This undesirable phenomenon Is 
avoided by the approach described below. 

II. FORMULATION OF AN ADAPTIVE PERFORMANCE INDEX 


The dl screte-tlme dynamics for each mcde Is assumed to be 
described by the ARMA model 


y( t)+aiy( t-l) + a 2 y( t-2) = biu( t-l)+b 2 u(t- 2 )+e(t) (1) 


where y ( t ) denotes modal displacement, u(t) denotes modal 
force, and e(t) Is a sequence of Independent, 
equally-distributed, normal ( 0 , 0 ?) random variables. It Is 
assumed that e(t) Is Independent of y( t-1) ,y( t-2) , . . . , 
u( t-1) ,u( t-2) , . . . and that the parameters a 1 .a 2 .b 1 .b 2 
are unknown constants. If we let Yt denote the Information 
available to the controller at time t. 


Yt ■ |y(t),y(t-l),..., u( t- 1) ,u( t-2) , . . . | 

x ( t > denote the modal parameter vector and e(t) denote ,a 
modal measurement vector, 

x T (t) = (ai ,a2 ,bi,b 2 ) ; 

®T( t ) « ( -y( t-1 ) ,-y { t-2) ,u( t-1) ,u( t-2) 

where ( . ) T denotes vector or matrix transpose, then (1) 
may be rewritten as 

y ( t ) = e T ( t)x( t)+e( t) 


( 2 ) 


(3) 


( 4 ) 
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where the constant parameter "dynamics" satisfies 


x(t + l) * x ( t ) 


( 5 ) 


It can than be shown, following the analysis of [8], that the 
conditional distribution of x(t+2) given Y^+i Is normal 
with mean x(t+2) and covariance matrix P(t+2) where x(t) and 
P(t) satisfies the difference equations 

x(t+l) * x( t) + K( t) (y( t)-e T ( t ) x ( t ) ) 

KC t) - P(t)e(t)/(a2*eT(t)P(t)e{t)) 

P(tn) = P(t)-(P(t)e(t)eT(t)P(t))/ 

( o 2 +eT ( t)P{ t)e{ t) ) 

Furthermore, the control law that minimizes the regulation 
crl terl on 

V c ( u( t) ) * E{y2(t+l)IY t } 

Is given by 


( 6 ) 

(7) 

( 8 ) 


(9) 


u ( t ) = 


x 1 (t+l)x 3 (t*l)+P 3i (t+l))e 1 (t*l) 

x|(t+l)+P 33 (t+l) 


( 10 ) 


where denotes the sum over 1 • 1 to 4 with the value 3 
excluded. 

To provide bounded modal Inputs that Improve parameter 
Identification accuracy while guaranteeing that the modal 
amplitude will not become excessively large, the controller Is 
designed to optimize, at each sampling Instant t, the following 
performance criterion: 


ml n max C V ( x, u( t) ) ] 
u( t) x 

subject to the constraints 
u(t) ^M, 0 £ X £ 1 

where 


V( x, u{ t) ) 


V c (u(t)) 


+ (1-x) 


Vj(u(t)) 


(ID 


(12) 


(13) 
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o 

V c denotes an acceptable or desired level of regulation cost. 
Vj(u(t)) denotes and identification cost function of u(t), 

V I ( u( t) ) = trace £p ( t+2 )J (14) 

Q 

Vj denotes and acceptable or desired level of Identification 

cost. The maximization Indicated In (11) yields a function 
V ( u ( t ) ) which, although not convex. Is Interpreted as 
specifying, for each admissible u(t), the most costly linear 
combination of relative regulation and relative identification 
cost. Minimization of V(u) thus yields the modal Input that 
minimizes this most costly combination of relative 
identification and regulation performance. 

III. SIMULATION RESULTS 

Since V c (u(t)) and trace P(t + 2) are relatively simple 
functions of u(t) the numerical solution of the one-step 
optimization problem (11)-(13) at each sampling time Is quite 
feasible. Results of simulation studies described below 
illustrate an Interesting feature of this approach: since the 

parameters Involved In the evaluation of V c (u(t)) and 
Vi(u(t)) depend on system measurements, the optimum 
distribution of relative cost, x (u) depends on on-line 
measurement data and hence, at each sampling Instant, the 
weighting between identification and regulation will change 
depending on the on-line system performance. This Is In 
contrast to[9] In which a fixed weighting between absolute 
control and Identification cost Is used at each sample time. 

In the simulation study we compare the performance of three 
control systems: 

a) A constrained adaptive controller that minimizes (9) 
subject to the control magnitude constraint. 

b) An optimum Identification controller that minimizes 
(14) subject to the control magnitude constraint. 

c) The one-step dual -adaptl ve controller based on 
( 1 1 ) - ( 13) . 

In Figures 1-3 we present simulated modal response data for the 
first flexible mode of the Langley beam experiment described In 
[10] where we assume here that a single actuator Is used. The 
accumulated on-line regulation cost, VT, shown In Figure 1 Is 
defined as 


VT('l) 


£ 


rno 


(15) 
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and the on-line Identification cost, PT, Is defined as 


PT( M) * trace [P(N)3 (16) 

where P ( M ) Is calculated on-line using (8). Mote that for the 
first 10 to 15 sampling times the regulation cost of the 
dual-adaptive controller Is close to that of the constrained 
minimum-variance controller and the Identification cost of the 
dual-adaptive control system Is close to that of the 
constrained one-step optimum Identification controller. 

Figure 2 Indicates that the dual-adaptive controller's actuator 
signals switch between Its limits, ^0.5, more frequently than 
do the actuator signals of the other controllers. This may be 
due to the lack of any energy constraint In the above problem 
formulation. 

A future study will examine the performance of the 
energy-constrained dual -adaptl ve controller In comparison with 
energy-constrained minimum-variance and one-step optimum 
Identification controllers. The relative regulation cost and 
relative Identification cost defined In (13) are plotted In 
Figure 3 where 

V c (N) = a 2 N (17) 

Is the accumulated control cost that would be achieved If the 
parameters of the system where known precise <y and If an 
unconstrained control law were used; o? > 10~* was used In 
the simulation runs. A constant value Yj 0 > 10-4 was chosen 
as Indicating the acceptable level of parameter 
Identification. Figure 3 Indicates that, depending on on-line 
measurements, the one-step Identification and regulation cost 
at one sampling Instant can have widely differing shapes from 
their respective distributions at other sampling times. This 
leads to the on-line variations In the dual-adaptive control 
strategy mentioned earlier. 

The simulation results Indication that the one-step, 
constrained dual-adaptive controller has the feature of 
providing, based on measured data, system Inputs that result In 
parameter Identification while maintaining bounded modal 
amplitude response. 
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ABSTRACT 

We present a coanand generator tracker approach to model following control 
of linear distributed parameter systems (BPS) whose dynamics are described on 
infinite-dimensional Hilbert spaces. This method generates finite-dimensional 
controllers capable of exponentially staple tracking of the reference trajector- 
ies when certain ideal trajectories are known -to exist for the open-loop DPS; 
we present conditions for the existence of these ideal trajectories. An adaptive 
version of this type of controller is also presented and shown to achieve (in 
some cases, asymptotically) stable finite-dimensional control of the infinite- 
dimensional DPS. 


I. INTRODUCTION 

By a distributed parameter system (DPS) , we mean a system whose dynamical 
behavior with respect to external disturbances is described by partial differ- 
ential equations. Of course, everything is a DPS if it is carefully scrutinized, 
especially if high performance is demanded, e.g., a simple electrical circuit 
at very high frequencies. However, lumped parameter (ordinary differential 
equation) approximations often suffice to describe the system behavior of many 
engineering systems. Indeed, such approximations are necessary for DPS control- 
ler designs to be Implemented with on-line digital computers. Nevertheless, 
the distributed parameter nature of control problems should not be discarded 
prematurely; otherwise, control approaches can be generated which look good on 
paper but are not sufficiently robust to operate with the actual system. This 
has been illustrated in computer simulation and in even a few laboratory demon- 
strations of flexible structures, yet, it continues to be ignored in some parts 
of the control community. To understand the controller-structure interaction, 
a DPS viewpoint is essential . 

The most rerious difficulty of the DPS viewpoint is that it requires the 
mathematical ideas of infinite-dimensional function spaces and unbounded oper- 
ators on these spaces; for example, see [1]— [2]. Several results in the past 
have been posed within this mathematical framework with the required mathematical 
rigor [3]. Yet, the necessary practical constraints were interpreted so that 
the results would be relevant to structural dynamiclsts and control system 
engineers and would make the maximum use of their experience and intuition. 

With these ideas in mind, the concept of model following appears to be a 
procedure that yields a useful finite dimensional controller that might be 
designed taking into account the distributed nature of the system dynamics, 
whereas early model following control systems required the satisfaction of 
certain ,: Perfeet Model Following" conditions which necessitated the use of a 
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reference model having the same order as that of the process [A] , the more 
recent output model following controller or Command Generator Tracker (CGT) as 
developed by Broussard [51 allows the use of a model of arbitrary order, provided 
that the number of controls is equal to the number of outputs being controlled. 
This concept in fact served as the basis for a finite dimensional adaptive 
controller that was used for controlling large structural systems [6, 7]. 

Thus since the CGT algorithm makes it possible to use a finite dimensional 
reference model which subsequently gives a finite dimensional controller regard- 
less of the process order. This provides the basis for a direct adaptive con- 
troller which produces stable closed-loop operation with the class of linear 
distributed parameter systems considered here. The difficulties of stable 
adaptive distributed parameter control are detailed in, e.g., [8]-[9] and the 
references contained therein. In Sections 2 and 3 the nonadaptive model 
following controller is developed and analyzed; in Section A, the adaptive 
version is presented and shown to produce a stable closed-loop. Conclusions 
and future directions are presented in Section 5. 

2. PROBLEM FORMULATION 


2.1 Process Description 


The distributed parameter systems (DPS) of interest will be modeled by the 
following state space form : 


j “ Av(t) + Bf (t) ; v(o) = v o 

1 y(t) = Cv(t) 


(2.1a) 

(2.1b) 


where the state v(t) is in an infinite-dimensional 
inner product (•, •) and corresponding norm 1 1 • j | . 
operators B and C have the sam e finite rank P, and 
i nputs for P linear actuators and the outputs from 
ly. Thus, 

P 

Bf (t) = l b f.(t) 

1=1 


real Hilbert space H with 
The bounded input-output 
f(t), y(t) represent the 
P linear sensors, respective- 


( 2 . 2 ) 


and 


y(t) = [y 1 (c). 


v p (t)] 


with 


y^(t) = (Cj , v(t)) ; 1 <_ j < P 


(2.3) 


where and belong to H. 


In infinite-dimensional theory, the operator A 


is a closed, linear, unbounded (differential) operator with domain D(A) dense in 
H. Furthermore, (2.1)-(2.3) represents some well-posed physical system, which 
in mathematical terms is the weak formulation of (2.1): 
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(2.4) 


t 

v (t ) = U(t) v q + / U(t-r) Bf(T)dT 
y(t) = Cv(t) i t > 0 

where is any initial state in H and U(t) is the C^-semigroup of bounded 
operators generated on H by A, This latter means: 


U(t+x) = U(t) U(x) ; 

t > 0, T >_ 0 

(2.5a) 

U(o) = I 


12.5b) 

lim (U(t) - I] v = 0 
t-K) + 

; v in H 

(2.5c) 

Av = [lim ^ )v ; 

v in D(A) 

(2.5d) 


t-0 + 


Note that the semigroup U(t) evolves the initial conditions v^ forward in time. 

When v is in D(A) and f(t) has continuous first derivative, v(t) also is differ- 
entiab?e, lies in D(A) for t >_ 0, and satisfies (° 1). However, any v q and H 

and any square-integrable f(t) wili satisfy the weak formulation (2.4) and yield 
states v(t) in H for all t ^ 0. Consequently, (2.4) is easier to work with in 
infinite-dimensions and is more likely to represent the actual physical system 
being modeled by (2.1). This form, (2.1) or (2.4), models most practical 
interior control problems for linear DPS where the actuator and sensor influence 
functions are given by b^ and c , respectively. 


For example, control of the damped wave equation on a region OCR 
single actuator and sensor is described by (for e > 0) : 


by a 


t-aiflL ♦ t . auu.o . b(l) f(t ) 

dx 

y (t) = / c(x) u(x,t) dx 

a 


(2.6a) 


(2.6b) 


where u(x,t) is the displacement from equilibrium of ft and the influence func- 
tions b and c can be taken as approximations of Dirac delta functions at the 
location of the actuator and sensor. 


A u(x,t) = l 
° l«l 


3 u(x,t) 
3x 2 „ 


The operator A q is the Laplacian given by 

(2.7) 


on D(A ) = {u(x,t)eH ju(x,t) is smooth and u(x,t) ' 0 on the boundary of Q}. 

O O A 

The domain D(A ) is dense in H = Ir(fl) with the usual inner product (♦,*). 

o o ° 

This can be put into the form (2.1) by choosing the state v(t) * {u(x,t). 
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SuU.Q .T 

3t J 


in H = D(A 1/2 ) 
o 


x with the energy inner product : 


(v, U ) = (A c 1/2 v r A o l/2 Wl ) o + (v 2 , « 2 ) o 


( 2 . 8 ) 


The operator A in (2.1) becomes 


A = 



I 

-el 


and the rest follows. 


(2.9) 


Another important example is the mathematical setting for large structural 
systems (LSS) which may be described as a continuin by the following system of 
partial differential equations: 


m(x)u tt (x,t) + D Q U t (x,t) + A Q u(x,t) - F(x,t) (2.10) 

where u(x,t) represents a vector of instantaneous displacements of the structure 
0 from its equilibrium position due to transient disturbances and the applied 
force distribution F(x,t). The displacements can be translational and rotation- 
al, and the forces can be generalized to include torques, as well. The mass 
density m(x) is positive and bounded on ft. 


The internal restoring force term A q u is generated by a time-invariant, 
symmetric, non-negative differential operator A^ appropriate to the LSS. The 

domain D(A ) of A contains all smooth functions satisfying the LSS boundary 
conditions°and is°dense in the infinite-dimensional Hilbert space * L (ft) 

with the usual inner product (*,*) o and associated norm ||*|| o . In most cases, 

the operator is assumed to have discrete spectrum, i.e.. Isolated resonances; 

this can be expressed by the following eigen-problem: 

Vk = w k *k (2 - n) 

where w^ are the vibration mode frequencies and are the corresponding 

vibration mode shapes . Of course, exact expressions for this modal data are 
rarely known for an actual LSS . 

The damping term D Q u t is composed of a skew syvnetrlc part, which repre- 
sents gyroscopic damping due to any on-board rotors or constant spin rate of 
the whole LSS, and a small symmetric part which represents the internal struc- 
tural damping and is thought to provide very low mode damping. 

The applied force distribution is 

F(x, t) * F (x, t) + F (x, t) (2.12) 

c D 
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where F Q represents the external disturbance forces on the LSS (and possible 
nonlinearities) and F £ represents the control forces due to P actuators: 


F 

c 


B f 
o 


P 


l 

i-1 


b i (x) f i (t) 


(2.13) 


where the actuator amplitudes are f^t) and the actuator influence functions 
are b^x) in H q . These are usually localized or point devices so that they 
approximate fi(x-x^); however, they do not have to be point devices. 

Observations are obtained by P sensors 


y = c 0 u + E 0 u t (2.14) 

where y.(t) = (c^ ,u^) + (ej,u t ) o , 1 <_ j P, with influence functions c^ for 
position sensors and e^ for velocity sensors in H q . Again, these are usually 
localized or point devices but they do not have co be. 

The LSS dynamics are defined by (2.10) and (2.14) can be put into the 
inf inite-dinensional state space form : 

| - Av(t) + Bf (t) + rf D (t) (2.15a) 

{ y(t) = Cv(t) ; v (o) = v^ (2.15b) 

with (A,B,C) as in (2.3) and the persistent disturbance term Tf p (t) obtained 

from F Q in (2.12). Impulsive disturbances in the structure are modeled by the 

initial condition v . 

o 

The Kille-Yosida Theorem (e.g. [1], Theo. 8, 9, p. 153), provides condi- 
tions under which an operator A generates a (^-semigroup U(t) satisfying: 

| |u(t) | | £ Ke _0t , t > 0 (2.16) 


where K >_ 1 and o real. The necessary ^nd sufficient conditions are given for 
the resolvent operator R(X,A) = (XI-A) : 

| ! R(X , A) n | | < - K — ; n = 1, 2, ... (2.17) 

(X+o) n 

for all real X > - o in the resolvent set of A, p (A) * £X complex |r(X,A) is a 
bounded operator on H} . The spectrum of A, o(A) = p(A) is much more compli- 
cated in infinite-dimensions, but, in finite-dimensions, it consists only of 
the (finite number of) eigenvalues of A. We say that A is exponentially stable 
when o > 0 in (2.16), i.e., the semigroup U(t) generated by A decays exponen- 
tially at the rate a. There are many other types of stability in infinite- 
dimensions, but no others provide the safety of a stability margi.i o; therefore. 
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this seems to be the kind of stability of most practical interest for engineer- 
ing applications where there is always some uncertainty in the model of JPS. 

2.2 Model Following Control Problem Formulation 

Given the DPS as defined in (2.1), it is desired to find a finite dimen- 
sional controller so that the output y(t) "follows" a desirable output trajec- 
tory y m ( t - This output trajectory is to be generated by the finite dimensional 

(asymptotically) stable reference model: 

« = V + Vm (2 ‘ 18a) 

= c B q ; ^<°) = % (2.i8o) 

where 

q is the model state vector having dimension N, 

u q is a step or reference level command with dimension P, 

y^ is the output trajectory also having the dimension P, 

and A , B are matrices with appropriate dimensions. It should be noted that 
mm 

the dimension of both y and u is the same as the dimension of the process 

to m 

input f and the process output y as defined in (2.1). Usually q * 0 will be 
chosen. 

The output model following control problem to be solved is the development 
of an algorithm that defines the process input f(t) so that the following two 
model following conditions (MFC) are satisfied: 

MFC 1) If yCtp = y m (t 1 ) , then 

y(t) = y m (t) . f »r t > t 1 

MFC 2) If y(t.) 4 y (t.), then 

y(t) asymptotically will approach y m (t), i.e. 

Aim [y(t) - y (t)] - 0 

t-x» 

3. DEVELOPMENT OF THE NONADAPTIVE MODEL FOLLOWING CONTROLLER 
3.1 Solution Definition 

In a manner similar to Broussard's development of the Command Generator 
Tracker (CGT^ [5], the concept of an ideal state v , control f and output 
trajectory y will be introduced. It is required that these trajectories 
satisfy the process dynamics (2.1) and that the ideal output y be identical 
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tc the model output y . Thus: 
r 

= Av (t) + Bf *(t) 

* * * * 

y (t) = Cv (t) ; v (o) = v 

o 

* 

where the ideal state v (t) is (as with v(t)) 
Hilbert space H. 


(3.1a) 

(3.1b) 


in the infinite dimensional 


Furthermore 

y*(t) - y m (t) = c m q(t) (3.2) 

In a manner similar to that in [5], it will be assumed that v (t) and 

* 

f (t) are linearly related to the model state vector q(t) and command vector 

u (t) as follows: 
m 


V (t) 
* 

' A 11 q(t) + S 12 u m 

(3.3) 

f (t) 

‘ S 21 q<t) + S 22 u m 

(3.4) 


The bounded linear operators S^, S 19 , will not be determined to 

satisfy MFC 1. 


To this effect, differentiation of (3.3) with respect to t and substitu- 
tion of (3.1) and (2.18) gives: 


9v (t) 
at 



11 


A q + S 
m 


11 


B u 
m m 


* 

Av 


+ Bf 


* 


(3.5a) 


where 


v = q + S. u 
o 11 n o 12 m 

is in D(A). 

Replacing v and f on the right side of (3.5) by (3.3) and (3.4) gives: 


(3.5b) 


s n V : £ n V. 

A < S ll1 + S 12 "»> + B < S 21 q + S 22 V (3 - 6) 

Now since (3.6) must be valid for all q and u m> it is necessary that: 

S.. A * AS,. + BS.. (3.7) 

11 m 11 21 

S 11 B » ’ AS 12 + BS 22 (3 - 8) 
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(3.9) 


Finally the incorporation of (3.2) yields 
y*(t) = CS n q + CS 12 U m = y m = C^q 

Thus : 


CS 11 = C m 

(3.10) 

CS 12 = 0 

(3.11) 


In summary then eqs. (3.7), (3.8), (3.10) and (3.11) must be^solved in 
order to find S 0 ^ and S ^ which in turn define the ideal control f of Eq. (3.4), 

Recall however, that both MFC 1 and MFC 2 must both be satisfied. In 
order to satisfy MFC 2, it is useful to consider the equation for the error 

e * v* - v (3.1?) 

* 

which is in D(A) when v and v are both in D(A). Differentiation of (3.12) 
with respect to time gives: 

* 

3e _ dv 
at " at 

= Av + Bf - (Av* + Bf*) 

= Ae + B(f - f*) (3.13) 

This equation suggests that the actual model following control f be defined as: 
f - f* + G(y - y m ) 

•k it 

= f + G C(v - v ) 

= f* + G C e 0.14) 

Substitution of (3.14) into (3.13) gives: 

e = (A + B G C)e (3.15) 

P P 

where G:R ->R is a bounded linear operator. Thus if G is chosen such that 
(A + B G C) generates an exponentially stable C -semigroup, then the control f 
as defined by (3.14) will satisfy the conditions for model following. 

It i9 important to note that this controller is clearly finite dimensional. 
For implementation it is only necessary to "build" a finite dimensional refer- 
ence model and form the proper linear combination of its state vector and command 
vector. The gain operator G is also finite dimensional and should be chosen 
such that the decay of any transient caused by initial plant model output error 
is sufficiently fast. We summarize the above discussion as 
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Theorem 1 : If (A,B,C) is exponentially output stabilizable and there exist 

bounded linear operators S ll* S 12 * $21* and S ^2 such that (3.7) - (3.8) and 

(3.10) - (3.11) are satisfied, then the model following control (3.4) an£ (3.14) 

satisfies the model |ollowirg conditions MFC (1) and (2) and lim [v(t)-v (t)] = 

0 when both v and v belong to D(A). t-* 30 

o o 

FroT" [10], ue see that (A,B,C) is exponentially output stabilizable if and 
only if 11^ = N(C) X and H R = N(C) form a pair of stabilizing subspaces for (A,B). 

Note that dim = P which is the number of sensors (or actuators) used. The 

conditions for existence of the ideal trajectories (3.1) will be developed in 
the next subsection. 

3.2 Existence of Ideal Trajectories 

* 

The existence of ideal trajectories v (t) for the DPS (2.1) is determined 

by solutions S. 4 to the operator equations (3.7) * (3.8) and (3.10) - (3.11). 

1 J 

These can be rewritten as 


A 

B 

s n 

S 12 

C 

0 

- S 21 

S 22- 



“ S ll 

0 


A 

m 

B 

m 


0 

I_ 


C 

m 

0 


(3.16) 


N P 

where S - :R -*D(A) and S ’.R -*D(A) are bounded operators with finite-rank and 
N 1 p P P iZ 

$ 2 i : ^ "* R and S ">2 :R "* R are matT ^ ces appropriate dimension. Note that 

(3.16) describes a kind of aggregation (in the sense of Aoki) for the infinite- 
dimensional system (2.1) injo a finite-dimensional system (2.17). The existence 
of the ideal trajectories v (t) in (3.1) guarantees that such an aggregation is 
possible, i.e. the DPS (2.1) generates the ideal trajectories which correspond 
to those of the finite-dimensional model (2.18). 


In most situations, the ideal initial condition will be v =0; hence, 

from (3.5b) we would choose q =0 and S 10 “ 0, which correctly corresponds to 

o Iz 

(3.11). This reduces the other operator equations to the following: 


S 11 A m A S n + B s 21 

(3.17a) 

S 11 B m " B S 22 

(3.17b) 

C S 11 = C m 

(3.17c) 


we have the following: 


Theorem 2 : If the spectra c(A) and a (A ) are separated by a smooth simple 

closed curve r containing o(A ) in its in?arior and a (A) in its exterior, then, 

m N P 

given any linear operator S-. :R -*-R , there exists a unique bounded linear 
N 

operator S^:R -*D(A) given by 
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(3.18) 


S ll q = 2^1 / R(X ’ A) B S 21 R(X »V qdX 


lor any q m R 


PROOF: From (3.17a), it follows that for any Xea(A)0 a (A ): 

m 

S 1t R(A,A ) - R(A,A) B S_ R(A,A ) - R(A,A) S 1t 
ii m 21 m 11 

But integration of (3.19) over the curve T produces: 


(3.19) 


0 - 2^1 J R ( X > A ) : n T dX " 27f / lS ll R < X »V q ' R(X * A) B S 21 R 0 *V qldX 


= S xl q - I R(A,A) B S„ R(A,A rt )q dA 


2-rri 


21 


because r encloses the finite number of singularities of A and excludes all of 
the spectrum of A. Clearly, since R(A,A) :H-*-D(A) , must°have its range in 

D(A), and this is the desired result. # 

Once, we have specified the matrix the unique operator is deter- 

mined. Satisfaction of (3.17c) could most easily be done by defining C to be 

m 

C S^. The determination of the matrix S ^2 for (3.17b) could be done from 

S._ = (B*B) _1 B* S., B (3.20) 

22 Ij. m 


as long as B is c sen so that a solution exists. Note that the operator B has 
10 * 
full ran* P and so the inverse of B B exists. 


Although the above existence result does not really require the number of 
actuators and sensors to be equal, this will be needed in the later sections. 
Also, the following alternative existence result requires it: 


-1 P 

Theorem 3 : Let zero belong to p (A) and C A B be nonsingular on R , then 


A B 

-l ] 

"®H n !2 _ 


A _1 (I-B(CA" 1 B) -1 CA“ 1 ) • A -1 B(CA“ 1 B)" i 

c o] 


_«2i ^ 22 - 


| 

_ (CA“ 1 B)" 1 CA“ 1 j -(CA _1 B) _1 


and S 


'll 


12 

satisfies: 


fi ll S 11 V S 21 


°21 S 11 A m + 


n 22 C m’ and S 22 


“21 S 11 ®m Where 


11 


n,, s.. a + ft. 9 c 

11 11 m 12 m 


(3.21) 


The proof of Theo. 3 can be obtained by straightforward computation using (3.16). 
Furthermore, note that 
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AS 


11 


Aft 


11 


S 11 A m + ^12 


C 

m 


- <i-«n 12 > 


S U A „ + <- Bn 22> C » 


■ S 11 A „ - B l°12 S U A * + °22 C J 


S 11 A m " BS ;i 


which is the same as (3.17a); however, Theo. 3 gives a wider range of solutions 
than Theo. 2 since S^ 2 need not be zero . The solution of (3.21) can be handled 

when zero belongs to (A ) because we then have the following: 


s n A 
11 m 


-1 


= ft 


11 11 


+ ft 


12 


C A 
m m 


-1 


(3.22) 


which has a unique solution whenever the o(A m ^) and a(fi.^) are separated 
by a smooth simple closed curve (see proof of Theo. 2). 


4. THE ADAPTIVE MODEL FOLLOWING CONTROLLER 


4.1 Development of the Adaptive Controller 

The nonadaptive control law (3.14) requires exact knowledge of the gain 

operators G, S ?1 , and S ?2 * These may be known to exist via mathematical structure 

of the DPS (A,B,C) in (271) (e.g. Theos. 1, 2, 3) but they may not be available 

in an explicit form. Consequently, we would need an adaptive version of (3.14): 

f(t) = S n (t) q (t) + S 22 (t) u m + G(t) e y (t) (4.1) 


where 



We assume throughout Sec. 4.0 thaj the hypotheses of Theo. 1 are satisfied 
for the DPS (2.1). Take e(t)* = v(t) - v (t) and, from (2.1), (3.1), (3.3) and 
(4.2), obtain (for v q and V q in D(A)): 


where 


Se(t) . 
8t 

e (o) 5 


A e(t') 4 BAK(t) 

c 


e 

o 


v 

o 


* 

V 

o 


r(t) 


A r A + BGC generates an exponentially stable C -semigroup U (t) and 


r(t) 


e y (t) 

q(t) 


u 


Ml Op 

belongs to R and AK(t) = K(t) - K q where 


m 
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K(t) = (G(t) j S (t) j S 22 (t)] and K q = [G j S J S ] 

II I I 

adaptive gain laws we sh*ll use ar: motivated by [6] ana have the form: 
K(t) * Kj(t) + K p u) 0 

K p (t)z - -r p e y (t) (r (t) ,z) ( t 

. K x (t)z - -rj" 1 e y (t) (r(t) ,z) (t 

» _ dK I N+2P 

where , z belongs to R , and T p , are both positive definite 

matrices on R . Note that (since is constant) : 


AK x (t) = K x (t) = -fj' 1 e (t) (r(t) , •) 


(4.5) 


where 


AK x (t) = r 7 (t) - k q . 


The closed-loop adaptively controlled DPS is given by (4.3) and (4.5): 


8e(t) 

3t 


A £ e(t) + F (t, e(t)) 


e (o) = e 5 
o 


K x (o) 


(4.6) 


where 


e(t) = 


F(t,e(t)) E 


AK x (t) 


BAK(t) r(t) 


A 0 
A = C 

c L° 


, and 


e (t) 
y 


. with e (t) * C e(t) and r(t) « q 

- r i V° ('(«>• ’ll y | u 


The state e(t ) of (4.6) resides in a new Hilbert space H where H = H x B (k , 
P 2 

R ) with (H, jHj) representing the Schmidt class of compact linear operators 

from into Hj with inner product (A,B) = tr A E where ,: tr M denotes the trace 

of the operator; eee [11] pp 262-264 for further details. The inner product on 
H is formed by summing those of H and we shall use the same symbols for ail 
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inner_products_( • , •) and their corresponding norms j|*||. The nonlinear func- 
tion F(t,*)‘fr*H is continuous; hence, 

e(c) = U(t) e Q ; t ^ 0 (4.7) 

where U(t) is the nonlinear semigroup defined on 11 by (for any h in H) : 

U(t)h = U^(t)h + J U c (t-t) F(T„U(T)h)dr (4.8) 

where _ 

[U (t) 0 

U (t) = is the linear C -semigroup generated on H by A in 

c Lo ii 0 c 

(4*6), The above follows from [12] Lensna 5.2 p. 186 where further details on 
nonlinear semigroups are also available; consequently, the closed-loop infinite- 
dimensional system (4.6) xs well-posed on H. 

4.2 Closed-Loop Stability 

The stability analysis of the nonlinear infinite-dimensional system (4.6) 
requires the extension of Lyapunov theory to infinite-dimensional spaces. This 
has been done in [12] -[13] and we summarize the necessary elements here: 

Def : The equilibrium point $_is stable for the system _(4.6) if for every e > 0 

there exists t > 0 such that j je(o) - VJ \ < <5 implies ] j e (t) - t for all 

t ^ 0. If, in addition to stability, there is a y > 0 such that j |e(o) - <J> | | < 

Y implies lim | |e(t)-$| | = 0, then ♦ is said to be asymptotically stable for 

t'X® 

(4.6). Usually we can take 4=0. We say an equilibrium point is unstable 
whenever it is not stable. 

"?ef : t A continuous functional V:H-*R is a Lyapunov function for (4.6) if V(o) = 0 
and V(e) £ 0 for all e in H wnere 

V(e) = lim sup (4. 9) 

t-K) + 

where e is in H and e(t) = U(t)7 a 9 given in (4.7). 

Lemma 1 : If V:H+R is a Lyapunov function for (4.6) with the property that 

vG) 1 f 1 (i|e||) (4.10) 

for all e such that ||e|| ^h (where 0 < h < “) and f^ is of class (i.e. 
f ,:(0,taj->-R + with f^(o) = 0 and f^ strictly increasing on [0,h], then the zero 
equilibrium point is stable for (4.6). 

Lemrar 2 : If in addition to the hypotheses of Lemma 1, the Lyapunov function V(*) 

has th property: 
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V(e) <_ -W(e) for all e in H 
W(e) >_ f _ ( | I e j | ) for ||e|| <^h 


(4.11b) 


where is also of class M. , then the zero equilibriua point is asymptotically 
stable hr (4.6). * 

The proofs of Lemmae 1 and 2 can be found in fl3J. These results constitute 
Lyapunov's Direct Method on infinite-dinenaional spaces. 

We now have the following stability result for our adaptively controlled 
closed-loop system (4.6): 

Theorem 4 : Assume the following: 


(a) In (4.3), A c = A + BGC satisfies 

(A v, Pv) + (Pv, A v) * -(Qv,v) 

c c 


(4.12) 


for all v in D(A) where P and Q are symmetric positive operators on H such that 
(for some ci, S positive constants): 


I I v | | 2 1 (v,Pv) <6 | |v| | Z 

u|lv| | 2 <_ (Qv,v) (i.e Q is coercive) 


(4.13a) 

(4.13b) 


for all v in H, 
(b) B*P = C, 


(4.14) 


(c) the hypotheses for Theo. 1 are satisfied, and both v q and v q belong to D(A) , 


then V(e) = (e,Pe) + (AKj, Tj AK^)» with AK^t) 5 K^t) - K q and e = |^|» is a 
Lyapunov function for (4.6) and the zero equilibrium point is stable. 


PROOF : 


Recall that 


AK(t) = AK^t) + K p (t) 
AK I (t) = K x (t) 


(4.15) 

(4.16) 


Now, clearly V is a continuous functional from H into R (due to (4.13a) with 
V(o) * 0. Furthermore, since V is a quadratic functional, it is Frechet 
differentiable. Hence, from (4.6) and (4.12), 

V(e) = -(Qe,e) + 2p (4.17) 

where p = [(Pe, BAKr) + (AK^.F^ A*K^)] 

From (4.16), (4.4c), and (4.15), we have 
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p = (B*Pe, AK r ) - (AKj, e y (r,-)) 

= (B*Pe , AK r ) - (r, AK^ e y ) 

= <B*Pe, A^r) + (B*Pe, K p r) - (r, AK* e y ) 

= (AKjr, [B*Pe - e y ]) + (K p r, B*Pe) (4.18) 

* * 

where we have used (A,B) = tr A B = tr(BA ). Furthermore, using (4.14) in 
(4.18), yields 

V « (K p r, e y ) = -(r p e y )||r || 2 (4.19) 

from (4.4b). Consequently, using (4.19) in (4.17), we obtain 
»W‘ -KQe.e) + 2(r p e y . e y )l|rl| 2 l 

I- MM I 2 ♦ 2 « p l!e y |! 2 ||r|| 2 ) < 0 (*.20 

where a = A _ (T ) and we have used (4.13b). 
p min p 

Also, using (4.13a), we have 
V(e) > | | e | | 2 + A nin (r i ) | | AK X | j 2 

In other words, f, ( 4 ) = [1 + X . (r 
l min 

abo^e satisfies the hypotheses of Lemma 1 and the desired result is obtained. # 

Note that the use of a proportional adaptive gain (4.4b) produced the second 
term in (4.20): however, this term is not essential and the above argument 
could be simplified by omitting (4.4b) from the adaptive gain laws. 

The hypotheses (a) and (b) correspond to the Kalman - Yakubovich conditions 
in infinite-dimensional spaces. From [13] Theo. 4.7, if for some real w, 

(Av, v) u>| | v | | for all v in D(A), then exponential output stabilization of 

(A,B»C)would be equivalent to satisfaction of hypotheses (2); however, there 
would be no guarantee that ? and 0 could be found in (4.12). such that (4.14) 
could be obtained. 1° finite-dimensional spaces, the Kalman - Yakubovich 
conditions are equivalent to the strict positive realness of the transfer 

function T c (s) = C(sl-A c ) ^B, i.e. Re T c (ju) > 0 for all real 01 ; see [14] pp. 

115-118. A number of papers, e.g. [15] - [17], have been written on this 

relationship in infinite-dimensional spaces. For example, [17] asserts that 

ReT (jw) must be coercive*which would be quite a bit stronger than what is 
c 

required in finite-dimensions. This is an area that requires further investi- 
gation. 

As pointed out in [9], we cannot immediately conclude asymptotic stability 
from (4.20) since it does not satisfy the hypotheses of Lemma 2. In finite- 
dimensional space, we could apply the LaSalle Invariance Piinciple to obtain 


j)]4 which is of class Therefore, the 
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asymptotic stability as is done in [6]; however, in infinite-dimensional spaces, 
it is not the case that '’bounded sets are precoapact** and this is essential for 
the LaSalle result. 


The following result ([13] Theo. 5.4 p. 188) may be helpful: 

L emma 3 : Let in (4.6) generate the linear C^-semigroup U^(t) on E and F 

is any_bounded x continuous function such that (4.6) generates a nonlinear semi- 
group U(t) on H (as given in (4.8)), then all bounded orbits of (4.6) are pre- 
compact if either 


(a) U q ( t) is compact operator for all t 0 
or 


(b) U c (t) is exponentially stable and the function F is compact (i.e. maps 

bounded sets into precoopact ones) 

A 0 

— _ c 

Due to the form of A = , it is not possible to satisfy (b); however, 

c [0 0 

(a) may be satisfied, for example by operators A which generate holomorphic 
semigroups. This latter is determined by the form of damping operator in a 
flexible structure. Again, this is a topic for further investigation. An 
alternative adaptive gain lav: 


KjfOv = -r 


-1 


(e y (r,v) + K I (t)v) 


(4.21) 


yields : 

V(e)< -[a | | e | | 2 + 21'iAKjjl 2 + 2( AKj, K q )] 


which does not quite give asymptotic stability but might be modified to do so. 

5 . CONCLUSIONS 

In this paper, we have presented a direct adaptive controller for linear 
distributed parameter systems (DPS) described on infinite-dimensional Hilbert 
spaces. The controller is based on a command generator tracker approach used 
in finite-dimensional spaces, e.g. [6] where it is shorn to be asymptotically 
stable. We have shown here that, under certain conditions on the open-loop 
loop DPS, ideal trajectories do exist and the adaptive controller is stable, 
i.e. the output and gain errors remain bounded. If the further condition that 
A in (2.1) generates a holomorphic C -semigroup is imposed, then we can also 
conclude asymptotic stability which guarantees asymptotic tracking or model 
following. 

A number of issues have been opened for further investigation: 

(1) use of dynamic rather than output feedback stabilization; 
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(2) generation of asymptotic ideal trajectories by the open-loop DPS; 

(3) connections between the Kalman-Yakubovich conditions and the input- 
output description of the DPS; 

(4) development of alternative adaptive gain lavs which produce asymptotic 
stability of the closed-loop system; 

(5) exploration of reasonable conditions under which LaSalle's Invariance 
Principle can be used to determine asymptotic stability of the closed- 
loop system. 
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SELF-TUNING ADAPTIVE CONTROLLER USING 
ONLINE FREQUENCY IDENTIFICATION 

W. W. Qia« and R. H. Cannon, Jr. 

Stanford University 
Palo Alto, CA 94304 


ABSTRACT 

A real-time adaptive controller has been designed and tested successfully on a fourth order laboratory 
dynamic system which features very low structural damping and a nou-colocated actuator-sensor pair. The 
controller, implemented in a digital minicomputer, consists of a state estimator, a set of state feedback 
gains, and a Frequency-Locked-Loop (FLL) fur real time parameter ideatiieatioa . The FLL can detect 
the closed-loop natural frequency of the system being controlled, calculate the mismatch between n plant 
parameter and its counterpart in the state estimator, and correct the estimator parameter in real time. The 
adaptation algorithm can correct the controller error and stabilize the system for more than 50% variation 
in the plant natural frequency, compared with a 10% stability margin in frequency variation for a fixed-gain 
controller having the same performance at the nominal plant condition. After it has locked to the correct 
plant frequency, the adaptive controller works as well as the fixed-gain controller does when there is no 
parameter mbmatch. The very rapid convergence of this adaptive system is demonstrated experimentally, 
and can also be proven with simple root-locus methods. 

L INTRODUCTION 

A controller using Kalman filter and fall state feedback usually has good performance, provided a 
very accurate model of the plant is known. But such controllers are very sensitive to parameter variation, 
especially when the plant has very low inherent damping, aad when the sensor is not colocated with the 
actuator. 

A two-disk laboratory model, consisting of two inertia disks connected by a torsion rod, which has a 
structural damping of 0.004, and with separated sensor and actuator locations was constructed to test several 
adaptive controller designs. The form of the equations of motion of the model is known due to the ease of 
analysis of the lumped system; but the lack of accurate knowledge about the natural structural frequency 
during controller design corresponds to a plant parameter uncertainty or variation; and this uncertainty is 
what the adaptive controller handles. 

It has been proposed by Kopf, Brown, Marsh (Rell) and Mac ala (Ref.2) to use a Phas: ,-ocked-Locp 
to implement tuned damping and notch filtered command torque, so that the feedback control force near the 
structural frequency can be adjusted properly according to the natural frequency of the plant, Rosen*:*al 
and Cannon (Ref.3) have implemented such a kind of controller for the two-disk experimental system. 

Under the same research project, a different approach using a Frequency-Locked-Loop (FLL) to identify 
the plant frequency was developed. Thi' oaper describes in detail how the FLL identifies the unknown plant 
parameter and updates the controller in real time. 
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H. DESCRIPTION OF THE TWO-DISK PLANT AND FIXED-GAIN CONTROLLER 


The plant to be controlled is a mechanical system which consists of two horiaental steel disks connected 
by a vertical elastic steel rod. The two disks are supported by bearings which allow rotational motion only. 
A low-friction DC motor is attached to the lower disk, and an RVDT sensor detects the angular position of 
the upper disk. 

If structural damping is neglected*, the state equation of motion of this system can be expressed as 



where z v and are the position states of the rigid body mode and the structural oscillation mode respec- 
tively, x 3 and are rates of those states respectively; u/* is the natural frequency, J is the total moment of 

inertia of the two disks, and u is the control torque from the DC motor. 

The sensor output b 

Jf - *1 + *i- (2) 

A first-order high-pass filter with 100 He cutoff frequency b used to differentiate the position sensor output 
and provides the pseudo-rate of the top disk. 

If all the parameters of the plant are known accurately, an LQG design (Ret4) will result in a set of state 
feedback gains G for regulation and estimator gains L for state estimation. However, if the plant natural 
frequency w» b not known by the controller designer, and a value u;« b used in the estimator, the stability of 
the whole system has to be analysed by augmenting the system state equations with those of the estimator 
states, and finding the modal frequencies and dampings of the system (ReL5) 

Using the same penalty weightings for control effort and state errors, an LQG design produces different 
feedback gains C and L for different natural frequencies u/ m of the plant. Analysb shows that the stability 
of the whole system is less sensitive to those feedback gains than to the parameter u/« used in the estimator, 
since an error in the latter parameter corresponds to a modeling error, while variations in the former ones 
correspond to different weightings in the LQG design process. In the experiment described here, feedback 
gains G and L are chosen for the nominal plant frequency, and are kept constant in order to demonstrate 
the adaptation of the controller by correcting w e in the estimator. 

Trom the analysb of the augmented system state equations, the frequency u/ e of the most unstable 
c .used-loop mode can be found as a function of w» and if all other parameters are kept constant. Thb 

'Udction. 

( 3 ) 

will asf< t the closed-loop performance of the adaptation process, and has to be taken into account in the 
design process. The two-disk model has a nominal frequency of 13.3 rad/sec, and the function described in 
eqa&tio 1 (3) can be shown approximately as in Fig. 1, *zd enn be approximated as 

w e - w. = (w„ - w«) +0.8 (4) 

for /u„ - w*/ < 1.5rad/sec. 

m. FREQUENCY IDENTIFICATION USING FREQUENCY-LOCKED-LOOP 

A Phase-Locked-Loop (PLL) was initially proposed to be used to detect the vibration frequency. PLLs 
have been used widely in locking onto high-frequency signals in electrical engineering applications, but it 

• It is actually 0.004 
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is only beginning to be used in locking onto low-frequency signab in mechanical systems. A PLL has the 
ability to identify the phase and frequency of a signal contaminated by a relatively large amount of noise at 
other frequencies. Several signal components at different frequencies can be identified by using several PLLs. 

The traditional PLLs are nonlinear elements for which the performance is hard to analyse and predict; 
and they have limited locking ranges due to their nonlinearity. Besides, PLLs are more sensitive to the 
phase than to the frequency of their driving signal, which makes them unsuitable for frequency identification 
because the identification will be disturbed by the phase in the sensor signal every time a new position 
command or an external disturbance is applied to the system, even though a PLL has identified the correct 
plant frequency already. 


A modification is made to a PLL to eliminate its sensitivity to phase in the input signal and make the 
input /output relation linear in a larger tracking range, so that it works better for frequency identification, 
while retaining the other virtues of PLLs. The final product, called a Fkequency-Locked-Loop (FLL), is 
shown schematically in Fig. 2, and its input/output relation can be seen from the functional block diagram 
in Fig. 3, where w, is the frequency of the input signal and u e is the output signal - the frequency detected by 
the FLL. Abo shown in the same block diagram are w„ the starting oscillation frequency; Aw, the correction 
on the output; and w er , the error of the output of the FLL. 


The character of the block G(i) can be chosen arbitrarily by the designer as long as it can update the 
output frequency of the FLL according to its error w tr . If a simple integrator j is chosen as the element 
<?(*), then the FLL will have a pole at -K where 


K = G(c -b) 
ab 


( 5 ) 


Parameters a and b should be determined with the following restriction 


> o > 6 > /w, - u/ 0 /. 

(«) 

In the present case, 


w* = l$.3rad/sec. 

(?) 

and the linear search range is chosen to be 


/u/g - Uq/ m ~ = 3.3rad/sec. 

(8) 


The pole location « = —K should be determined as the result of a compromise between speed of response 
and nobe rejection, at the nominal locking frequency range. In this case, the parameters of the FLL are 
chosen as 


a = 6.0, 6 = 4.0, G = 20.0, 


K = 1.67, 


(9) 


to work in the range of 1 to 3 He. 


With parameters chosen as above, the block diagram in Fig. 3 can be simplified to the transfer function 


<?(•) = 


Oo(s) 

n,(«) 


K 

(• + *)' 


( 10 ) 


Fig. 4(a) shows the test result of the FLL output when the frequency of the input signal b changed stepwisely. 
The response for small input change (the first change in Fig. 4(a) ) b similar to the step response of a first- 
order filter with pole at -K, as shown in Fig. 4(b). The response tor a larger input change (the second 
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change in Fig. 4(a) ) experienced some nonlinearity at the beginning bectuse its internal structure is not 
linear; however, the FLL still tracked the bpnt signal and provided the correct ontpnt in a reasonable time. 

IV. CORRECTION OF PARAMETER ERROR IN THE CONTROLLER 


Because eigenvalues are properties of the system, they are independent of the instantaneous value of 
state variables and are influenced only by changes of parameters. The relation between u t and w e , as shown 
in Eqn. 4, can be expressed as in Fig. 5. Using the difference between (w 0 - 0.6) and w, to update - 
through the integrator * - the parameter u, in the controller, the closed loop dynamics of the parameter 
variation, identification, and correction can be expressed as in Fig. 6. The characteristic equation of the 
closed parameter adaptation loop is 


1 + 


HK 

(# + *)(• + *) 


0 , 


( 11 ) 


or, 

*(• + JT) + (# + 2K)H s 0, 


( 12 ) 


which can be written in Evan’s form as 


«(« + *) _ 
(• + 2 K) 


(IS) 


The root locus of Eqn. 13 vs. the positive value of H with K — 1.67 is shown in Fig. 7, and the value cf 
H = 9.9 is chosen obviously to maximise the adaptation rate. The change of the slope in Fig. 1 corresponds 
to a variation in the gain in Eqn. 4, and Eqn. (11) can be modified as 


tHK 

1+ (# + *)(# + *) 


= 0 , 


(14) 


where 2 > r > 0 , and the root locus shown in Fig. 8 guarantees the stability of the system over the range 
of the gain V. 


Any sensor measurement, controller state variable, or linear combination thereof can be chosen as the 
input signal to drive the FLL, so long as the signal contains the modal frequency of interest (th> larger the 
better!). The error between the sensor rate and the estimate of it is chosen to drive the FLL, since there is 
less error signal if all parameters in the controller are correct. 

The FLL must be turned off if its input signal is too small, in order to reject the influence from random 
noise. 


A PDP-11/23 minicomputer was used to implement the controller and the FLL at 25 He sample rate. 
The test results of this adaptive system are summerized in the following section. 


V. EXPERIMENTAL RESULTS 

Fig. 9 shows the natural oscillation of the uncontrolled disk system. The frequency of oscillation is 
2.11 Hs. with 0.004 damping. (The long-period motion is caused because the disk system is hung from the 
ceiling with a long steel wire to reduce the axial thrust on bearings. This mod* is approximated as a rigid 
body mode in the controller design analysis.) 

Fig. 10 shows the step response of a nonadaptive control system designed with the LQG method. The 
response is very good (Fig. 10) when there is no modeling error in the controller design. However, as Fig. 11 
shows, the system becomes unstable when there b 10% modeling error in frequency in the designing of the 
nonadaptive controller. 
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When the FLL is used in the ndnpthre control, the system can detect and correct a controller’s parameter 
error of 50% or more in frequency. Figs. 12 (a) through (f) show the sensor output in different tests. The 
instability due to the initial parameter error is shown when the control system was just tamed on, and the 
system was then stabilized after the adaptation algorithm had corrected the controller’s error. The initial 
turn on of the control system and the time when position commands are changed are marked on those 
recordings. 

Fig. 13 shows the comparison of the impulsive disturbance response, between the nonadaptive controller 
with no modeling error and the adaptive one after its parameter error has been corrected. The comparison 
shows almost no difference between their performances. 

VI. DISCUSSION 


(A) Frequency-Locked-Loop 

The FI 1/ is a nonlinear element, but its input/ontpnt relation is almost linear. It behaves linearly for 
40% changes in input signal frequency, and still works for 100% change in frequency in the nonlinear region. 
The test recorded in Fig. 4 attests to the discussion above.. The linear range can l chosen by selecting 
parameters properly. 

The FLL still works when the amplitude of its input signal is as weak as two quantization intervals of 
the A/D converter, if it is free of noise and bias; bnt in real applications it must be turned off at small level 
of input signal to reduce the effect of noise. 

The FLL can identify the plant characteristic in a small window of the frequency spectrum, so that the 
effects of other parts of the system dynamics do not have to be taken into account if they ate not critical 
to the overall performance. It can only detect modes that are either only slightly damped or unstable, since 
they can provide oscillatory signals for detection; however, heavily damped modes are usualiy robust to 
parameter uncertainty and don’t need adaptive control. 

(B) Parameter Error Correction Loop 

The parameter error correction scheme can be determined by root-locus analysis, or even by the LQG 
method, since the FLL has a linear characteristic. 

Fig. 12 shows some small-&mplitnde vibration building up doe to the lack of signal to lock the FLL, but 
the parameter estimate error was soon corrected and vibration suppressed. 

By examining the response to command change and to disturbances, it is found that the Sett- Tuning 
Adaptive Controller behaved almost the same as the correct fixed optimal controller, except for the few 
cycles of vibration at the beginning when the parameter error was being corrected. 

It is better to use the error of an estimated sensor output to drive the FLL, since it is undisturbed by 
the control force during a new command change if the model is correct. 

Both the identification and error correction are running in real time while the controller is doing i*s job. 
Any change in the plant can be tracked and adapted to rapidly. 

VD. CONCLUSION 

The use of FLL in identifying system vibration frequency and adapting controller parameters is promis- 
ing. All kinds of controllers, such as Kalman filter and state feedback, band-pass, or notch filters can have 
their parameter errors corrected in a similar way. It is expected that system with many vibration modes can 


149 



be bandied with several FLLs. 
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Fig. 8 Variation of Poles of the Closed Parameter Loop '••nos r, the Changing Slope in Fig. 1. 
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(a) Vibratioa freqneaejr was aasamed to be 1.9 Hs. f>10X error ia freqaeacy) ia Kalman Fflter. 
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• T.O. : initial tamed oa. 

C.C. : step command change. 

Fig. 12 Step Response of the Adaptive Controller with FJLL Detecting Initial Modeling Error in Plant Freqnencjr. 
(a) no error. 

(bi -10% error. 

(c) +25% error. 

(d) -25% ertor. 

(e) -50% error. 

(() +50% error. 
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(*) (b) 



Fig. 13 Comparison of the Impulsive Disturbance Response between (a) the Nonadaptive Controller with No 
Modeling Error and (b) the Adaptive One after Its Parameter Error Has Been Corrected. 
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ADAPTIVE FILTERING FOR LARGE SPACE 
STRUCTURES— A CLOSED-FORM SOLUTION 

H. E. Rauch and D. B. Schaechter 
Lockheed Palo Alto Research Laboratory 
Palo Alto, CA 94304 


ABSTRACT 

In a previous paper Schaechter proposes using an extended Kalman filter to 
estimate adaptively the (slowly varying) frequencies and damping ratios of a 
large space structure. The present paper shovs that the time-varying gains for 
estimating the frequencies and damping ratios can be determined in closed-form 
so it is not necessary to integrate the matrix Riccati equations. After cer- 
tain approximations, the time-varying adaptive gain can be written as the. pro- 
duct of a constant matrix times a matrix derived from the components of the 
estimated state vector. This is an important savings of computer resources and 
allows the adaptive filter to be implemented with approximately the same effort 
as the non-adaptive filter. The success of this new approach for adaptive 
filtering has been demonstrated using synthetic data from a :wo mode system. 

I . INTRODUCTION 

Adaptive estimation and control techniques are being studied for their 
future application to the real-time control of large space structures, where 
uncertain or changing parameters may destabilize standard control system 
designs. In a recent paper Schaechter [l] proposes using an extended Kalman 
filter to estimate adaptively the (slowly varying) frequencies and damping 
ratios of a large space structure. For a system with N states and M (slowly var- 
ying) parameters the extended Kalman filter requires integration of an N+M by N-Ht 
nonlinear matrix Riccati equation to determine the covariance and gain for the 
filter. Schaechter introduces approximations which allow the integration of 
the nonlinear matrix Riccati equation to be replaced by integration of a smaller 
set of linear matrix equations. The N states of the system are estimated using 
constant gains determined off-line. The time-varying gains for estimating the 
(slowly varying) of M parameters are determined on-line by integrating an 
M by N set of linear matrix equations. 

The contribution of the work presented here is to show that the time- 
varying gains for estimating the (slowly varying) frequencies and damping 
ratios can be determined in closed-form so it is not necessary to integrate 
the M by N set of linear matrix equations. This is an important savings of 
computer resources and allows the adaptive filter to be implemented with 
approximately the same effort as the non-adaptive filter. In particular, 
after certain approximations the time-varying adaptive gain can be written 
as the product of a constant matrix times a matrix derived from the components 
of the estimated state vector. The constant matrix is determined off-line 
just as the constant gains for estimating the state are determined off-line. 
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The success of this new approach for adaptive filtering has been demon- 
strated on a computer simulation using synthetic data from a two mode system. 
Work in progress is applying the new approach to a much larger system using 
experimental data. The theoretical development and preliminary experimental 
results are presented in the paper. 


II. FORMULATION WITHOUT ADAPTIVE FILTERING 

The standard state variable formulation of the dynamic equations of motion 
are shown below where the dot indicates derivative, x is the state vector, u is 
the control vector, z is the measurement vector, and w and v are dynamic noise 
and measurement noise. [2] 


x ■ Fx + Gu + T w (1) 

z = Hx + v 

When the dynamic system is precisely known, a state estimator of the following 
form may be constructed where x indicates the estimate of the state x and K is 
the gain matrix. 

x * Fx + Gu +K(z - Hx) A (2) 

The differential equation for the estimation error x > x - x is obtained by 
subtracting Eq. (2) from Eq. (1). 

• 

x * (F - KH)x + Pw ~ Kv (3) 

The differential matrix equation for the covariance of the estimation error P 
follows where R and Q are from the covariance of tha measurement noise v an 
the dynamic noise v. 

P « E (%c T ) 

P - (F - KH)P + P(F - KH) T 

+ rqr T + krk t (4) 

The optimal gain matrix K is chosen to minimize the trace of the estimate 
error covariance to give the usual result 

K » PH 1 ^ 1 (5) 

Notice that for a precisely known dynamic system, the estimation gains may be 
precomputed, even in the event of a time varying system. The analysis used 
with the adaptive filter closely parallels the development without adaptive 
filtering. 


III. ADAPTIVE FORMULATION AND SOLUTION 

Adaptive control may be required when the model in Eq. (1) is unknown, 
uncertain, or dependent upon a changing system configuration. The modifica- 
tions that need to be made in Eq. (1) in order to include the effects of an 
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uncertain parameter are given below where the vector parameter a has a dynamics 
matrix C with dynamic noise w^, 

x ■ F(a)x + Gu + Tw 
a * Ca + w 

a 

z * Hx + v (6) 

As can be seen from Eq. (6), the system dynamics are now a function cf the vec- 
tor parameter a. In this formulation, the vector parameter a represents small 
changes from a nominal value co the average value of a is zero. These param- 
eters are assumed to be slowly varying so that they may be adjoined to the 
state vector. An adaptive state estimator may be written so both the state 
vector and the vector of parameters are updated using the measurements. 


x * F(a)x + Gu 4* K^(2 - Hx) 

a = Ca + K (z - Hx) 
a 


(7) 


Let the symmetric matrices P x and P a represent the covariance of th** *>rror in 
the estimates for x and a, respectively, and let the rectangular matrxx P ax 
represent the cross-covariance of the errors in the estimates of x and a. 

It is necessary to calculate these covariance matrices in order to determine 
the optimal gains l ^ and K a . The optimal gains are selected to minimize the 
trace of the covariance of the estimation error and have the following values. 


T -1 

K * P H X R 1 
x x 


f -I 

K * P H A R 
a 


( 8 ) 


Proceeding as before, and assuming the estimation error a * a - a is small, 
gives the vector differential equation for the error, 


x * (F - K H)x + 
x 


ts-1 


a + Tw - K^v 


(9) 


«-KHx + Ca + « - K v 

a a a 


The Ka^rix differencial equations for the covariance are: 
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where 

and 


dP x /dt - (F-KH) P x + P x (F-K x H) T 
+ Q + K RK T 

x XX 

+ < V > P a « +p .* I < p . i ) 1 

dP ix Mt - cp #x + p^CF-iyo 1 

+ P a (F a x) T - K a HP x 
T 

+ K RK 
a x 

dp /dt - CP + P C T + Q 
a a a a 

- \ W ax - P .x«.W T + *. “a 1 


F = 9F/8a 
a 

R, Q, and Q are covariances of v, Tw, and w 
a * 

(without delta function) . 


( 10 ) 


The remainder of this analysis will show approximations which can be used to 
reduce the computational effort needed to calculate the covariance matrices 
and the optimal gains when the covariance matrix P & is very sm*ll (of 
order t) and the covariance matrix P ax is also very small (of urder e) * The 
gain K a will be very small (of order e) because it is calculated from P ax , 

^he differential equation for the covariance matrix P x will involve somf small 
terms, but most of the terns are larger and constant. If the last two terms 
in the differential equation for P x are neglected (because they are small te:*ms 
of order e) , it is possible to calculate the steady-state constant value of the 
covariance P x . From the constant value of tha covariance P x the constant gain 
can be determined. As one might suspect, the constant gain has „he same 
value as it would have if there were no errors in estimating the parameters 

Because the covariance matrices P ax and P a are of order e, many of the tei 
In the differential equation for P a :?•* of order e squared. If the last 4 
terms in the differential equation for P a are neglected (because they are v 
small terms of order z squared), It is pi .sible to calculate the steady -state 
value of the covariance P fl (to order e). As one might suspect, the constant 
steady-state value obtained for P a is the same value which would have been 
obtained if K a were zero. 

All that remains is to calculate the time-varying covariance r d so .1 at the 
needed gain K a can be determined. Because the gain has been a\ to 

equal P X H T R"^, the last two terma £:• the differential equation foi D ay cancel 
out. For the remaining analysis it will be a turned there are N stale vari- 
ables so the first N/2 variables (designated he N/2 length * ^ctor x*) cor- 
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respond to mode position, an c the last N/2 variables (designated by the N/2 
length vect r x**) correspond to velocity of mode position. The differential 
equatioas c or the dynamics of the mode variables without any forcing or dis- 
turbing terms are presented below where A* corresponds to the damping terms 
(-25w) and A** corresponds to the frequency terras (- 0 )“) . Notice both A* and 
A** are diagonal N/2 by N/2 matrices. 


dx/dt - Fx 


dx*/dt = x** 


(ID 


d::**/dt - A*x*+A**x** 


Let there be N parameters in the vector a and arrange the order of the param- 
eters a so that the first N/2 parameters are the same as the elements of the 
dicgcnal matrix A* and the last N/2 parameters are the same as the elements 
of the diagonal matrix A**. Furthermore, assume the N-by-N symmetric co- 
variance matrix P a associated with these parameters is diagonal and composed 
of diagonal sub-matx* P a * and P a **. With these assumptions, the partial 
derivative can be w ** in a particularly simple way where x* and x** 

represent diagonal ut_ l JlC^S with the diagonal elements equal to the vectors 
x* and x** 


Fx 


F x 
a 


V V 


0 I 


x* ' 

A*A** 

«- J 


X** 



- P *** [x] 

a 


( 12 ) 
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One further assumption is that the dynamics matric C (for the parameters a) is 
diagonal and equal to the scalar Cq times the identity matrix I. With those 
assumptions, the differential equation for the cross covariance P ax can be 
written as fo^ows where x is a diagonal matrix made up of the elements of x. 


dP /dt = P (F-K H+C) T 
ax ax x 


+ (P a ***[x] ) T 


where 


and 


C = 


I 


0 

p *** = 

3 P * 

a 



( 13 ) 


The remainder of the analysis will deal with the cross-covariance matrix P^ 
which is the transpose of the covariance matrix P ax . The differential equa- 
tion for the cross-covariance P xa can be written as follows: 


where 


dP /dt = F* 
xa 


P 

xa 


+ P a***[ ; ] 


F* = F - K H + C 
x 


( 14 ) 


The linear matrix differential equation for P xa has particularly desirable char- 
acteristics. All the terms in the differential equation are known constants 
(because the gain K x and the covariance P a are known and constant) except for 
driving terms due to estimates of the state x. If the approximation is made 
that the derivative of the forcing terms x is equal to the dynamics matrix F 
times x, then, except for transient terms, the solution to the linear matrix 
differential equation for P xa can be written in closed form as a linear com- 
'na^ion of the torcing terms x. This is similar to the result in elementary 
linear differential equations where the general solution is composed of the 
sum of the homogeneous solution due to the unforced differential iquation and 
the particular solution due to the forcing function. 

Because the forcing function [x"| is a diagonal matrix, the first element Is 
the forci \g term for the first column of the solution for the matrix P a ^, the 
second element X£ is the forcing term for the second column of the matrix P ax , 
and so on. Let Pi be a vector which represents the i-th column of the matrix 
P^x* T^ e l insar matrix-vector differential equation for the i-th column can 
be written as follows where P a i»is a scalar which Is the i-th element of the 
diagonal matrix P a and x^ ’s a scalar which is the i-th element of x and P| is 
is the i-th col^nn of the matrix P a *** which is all zeroes except for entries 
equal to the diagonal elements of P a 
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(15) 


dP./dt = F* P, 
1 i 


+ P.* x. 
1 i 


The solution for the vector P^ is assumed to be composed of the sum of two 
vectors. The first vector is the constant vector times the scalar xj 
(corresponding to the estimate of the position of the mode) and the second 
vector is the constant vector times the scalar x^ (corresponding to the 
estimate of the velocity of the appropriate mode). 


P. = E. x. + G.x. (16) 

1 1 J 1 K 

where for i <N/2 then j = i and k = i + N/2 
for i >N/2 then j = i-N/2 and K * i 

The derivative of the vector P^ can be calculated directly if it assumed the 
derivative of the vector x is equal to F x with A* and A? being scalars which 
represent the j-th element of the representative diagonal matrices which make 
up F. 


dP^/dt = E^dx^/dt + G^dx^/dt 


= + G^A*Xj + Gj Aj **Xj [ 


07) 


Substituting the expression for the assumed form of the vector P^ and the 
expression for the derivative of the vector P^ into the differential equation, 
gives the following equations where 6^ is a discrete delta function which is 
unity if i equals j and zero otherwise. 


G i A j**j + <E i + G i A J** ) *k 


U8) 

* F * Vj + p * G i*k * S lj P j *- j + ! ik P k* *k 

Collecting all terms which multiply the scalar x. gives one. veccor equation 
and collecting all terms which multiply the scalar x k gives a second vector 
equation. The j are two vector equations and two unknown vectors and Gi. 


G iV ■ F * E i " Vi* 

E 4 + G lA) ** - F* Gj + 6 lk P k * 


(19) 
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The expression for E^ obtained fiom the second equation is substituted into the 
first equations to give a single equation with the unknown vector Gj 


G.A.* = F* (F*G , - G.A.** + 6,, P, *) + 5..P.* 
1 ] i i 3 ik k i] j 


( 20 ) 


Since nj* and Aj** are both scalars, it is possible to solve directly for the 
unknown vector G^ where I is the identity. 

G. = (IA.*+F*A.** - F*F*) _1 (6 . .P . *+6 .. F*P *) (21) 

l J J ij j ik k 

In the same way, the expression for obtained from the first equation is sub- 
stituted into the second equation to give a single equation with the unknown 
vector E-^. 


A . *E. = (F*-IA. **) (F*E. + . . P *) + A.*6..P * (22) 

J i J i ij j jikk 

It is again possible to solve directly for E^. 

E x - (I*.. + F* Aj .. - F*F*) -1 [ 6 ij (F *- IA J ** )P j* +S lk A j *V ] < 2 « 

Thus the two unknown vector quantities and G± can be determined from known 

quantities so the covariance vector F± and the approximation for the covariance 

matrix P can be determined, 
ax 
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IV. SIMULATION RESULTS WITH TWO MODES 


The new, simplified adaptive formulation was first tested with a single 
mode system. After encouraging results were obtained with one mode, a two-mode 
system was examined. The two-mode system used in the simulation studies is 
shown in Figure 1. 



FIG. I TWO-MODE SYSTEM 


The system consists of two masses, M, three springs, K, and three viscous 
dampers, B. For this study, M-l, K=l, and B»0.10. Control forces may be 
applied to both masses, random external forces disturb both masses, and noisy 
measurements of the position of both masses are available. The measurements 
are used for estimating the state vector, and for estimating the parameter 
vector. The differential equations representing this system are: 


Mx^ + 2Bx^ - BX 2 + 2Kx^ - “ f^ + 


Mx^ + 2 Bx 2 - Bx^ + 2 KX 2 - “ ^2 + W 2 


z i = x i + v i 
z 2 * x 2 + v 2 


(24) 


The natural frequencies and damping ratios of this system are: 

* 1 Cj * 0.05 

b>2 - 1.732 ? 2 - 0.0869 

where the low frequency mode is the common mode motion of the two masses. The 
spectral densities of both the process and measurement disturbances (Q and R) 
are 0.0163. Two hundred position measurements of both the masses were made 
during a sixty second computer simulation. This sixty second duration was 
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selected to assure about ten oscillations of the lowest frequency mode. The 
sample rate was selected to give about ten samples per cycle of the highest 
frequency mode. The value of the correlation time constant for the parameters 
that are to be estimated was 250 seconds (so C Q is 1/250). This value is much 
larger than the time constants of the system. The selection of a "large" 
value is important in order to allow the adaptive filter to average values 
over several cycles of the system. The fol’ wing table gives a summary of 
the test cases. In each case, both the standard, non-linear extended Kalman 
filter, and the simplified extended Kalman filter described in this paper 
were run in older to make comparisons. In all of the cases studied thus 
far, these two cases were indistinguishable, except for a small, initial 
transient. This transient effect is attributed to beginning the standard 
extended Kalman filter covariance integration with values slightly different 
from the steady state values. 


CASE 1 

u) ^ unknown 

known 

no noise 


unknown 

5 2 known 


CASE 2 

unknown 

unknown 

no noise 

— . — «| 

unknown 

£ 2 unknown 


CASE 3 

ojj unknown 

unknown 

noise 


a), ur.xnown 

4. 

t 2 unknown 



The results are shown i the following figures and are discussed below. 
In Case I the starting estimates for the natural frequencies were chosen 
to be 10% in error with estimated to be 0.9 (rather than 1.0, and u 2 
estimated to be 1.559 rather than 1.732). The damping parameters were 
exact, and no noise was present in the system. The results for the estimate 
of w l (Fig. 2) show that the modal frequency is very readily identified from 
the measurements, inspite of the 10% initial error in the estimate. As the 
system response diminishes, less information is available for updating the 
parameters. Consequently, with no new information coming into the system, 
the parameter estimate begins to return to its nominal value (0.9) with the 
selected time constant of 250 sec. The estimate of “2 behaves similarly. 

In Case II, the objective was the same as in Case I with the additional 
problem of simultaneously estimating the damping parameters. The initial 
estimates of the damping parameters were zero. The results of the poor ini- 
tial guess of the damping parameter are evident in Fig. 3. The estimate 
of the modal frequency tends to be lightly damped, but in all other aspects, 
the estimate of w i appears to have the same features that were present in 
Case I. As has been found in past studies fa the estimate of the dampii $ 
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parameter itself is quite poor. This is due to the fact that the position 
measuxement contains very little damping information. 

Case III is identical to Case II with the addition of both process and 
measurement disturbances. Surprisingly, this case yielded the best results, 
as is evident in Figures 4 and 5. The effects of the noise are clearly 
visible in the figures. However, in contrast with the previous two cases, 
the process noise continues to excite the system after the transient effect 
of the initial conditions have subsided. The result is that the measurements 
continue to provide information on the parameters for the duration of the 
simulation. Since the higher frequency mode is more heavily damped, and is 
less perturbed by the external disturbance, the improvement in the natural 
frequency estimate of mode two is not as dramatic. 

CONCLUSIONS 

This paper has developed approximations which allow dramatic reductions 
in the on-line computational requirements of the extended Kalman filter. 
Numerical simulations of this technique have validated the approach for two 
simple spring-mass systems. It was found that the full non-linear extended 
Kalman filter and the closed-form adaptive filter developed in this paper gave 
virtually identical results. Work is currently in progress to apply this 
approach to a much larger system using experimental data, rather than simulated 
data. 
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ROBUST ADAPTIVE CONTROL 
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ABSTRACT 

The paper discusses several concepts and results in robust adaptive control 
and is organized in three parts. The first pa^ . 'nr\eys existing algorithms. 
Different formulations of the problem and theoretical solutions that have been 
suggested are reviewed here. The second part contains new results related to the 
role of persistent excitation in robust adaptive systems and the use of hybrid 
control to improve robustness. In the third part promising new areas for future 
research are suggested which combine different approaches currently known. 

1. INTRODUCTION 

The stable adaptive control of linear time invariant plants, in what is now 
termed "the ideal case", was resolved in 1980 [1-4], The assumptions made in 
[1-4] regarding the plant to prove global stability are quite stringent. Specifi- 
cally, knowledge of the sign of the high frequency gain K , the relative degree 
n* and an upper bound n on the order of the plant transfe? function are assumed 
to be known. Further it is assumed that the zeros of the plant transfer function 
lie in the left half plane, the plant parameters are constant (though unknown) and 
the system is disturbance free. However, in practice, these assumptions are rare- 
ly met. No actual plant is truly linear, finite dimensional or noise free. Fur- 
ther, in practical situations, the rationale for using adaptive control is to com- 
pensate for large variations in plant parameters. In the presence of such devia- 
tions from ideal conditions, the algorithms suggested In [1-4] no longer assure 
the boundedness of the signals in the adaptive loop. This accounts for the wide 
interest in the past few years in what is termed robust adaptive control to 
achieve satisfactory performance in the prese ce of both mrdeling and operating 
uncertainties. This paper attempts to survey some of the modest gains that have 
been made in this direction, presents some new results for improving robustness 
and discusses promising directions for future research. 

Adaptive systems are special classes of nonlinear systems and many questions 
which arise in such systems can be stated as problems in the stability theory of 
differential equations. In particular, questions of robustness can be addressed 
using amply discussed results cn practical stability and total stability. Since 
such results are bound to find increased application in adaptive systems, some of 
the more frequently used concepts, definitions and theorems are collected in 
section 2. 

Recent years have witnessed many contributions to the robustness problem. 
Among these some assume additional prior information regarding the ur ^rtainties 
to suitably modify the adaptive algorithms [5-9 ] while others assr that the 
reference Inputs possess properties which make the ideal system e k onentially 
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stable. In all cases it is shovm that boundedness of solutions is assured when 
the true situation deviates in specific ways from the ideal. Some of these ana- 
lytical results which are currently known are presented in section 3. 

Sections 4 and 5 contain some new results on persistent excitation and hy- 
brid auaptive control which are relevant to the problem of robustness. In sec- 
tion 4 a nonlinear error equation of second order is discussed in detail. While 
the ideal system is uniformly asymptotically stable it is shown that unbounded 
solutions can result if the disturbance is sufficiently large. It is also shown 
that by increasing the degree of persistent excitation ot the reference input the 
overall system can be made practically stable. Section 5 discusses hybrid con- 
trol algorithms recently introduced by the authors [10]. The same algorithms can 
also be modified to adaptively control discretj plants by updating control param- 
eters infrequently. Some plausible arguments are given towards the end of the 
section as to why such algorithms May be more robust than continuous algorithms 
when external bounded disturbances are present. 

Finally, in section 6, possible vays of combining known methods are discussed 
in the hope that it will stimulate research in these new directions. While no 
hard results exist in these areas the suggestions are based on extensive simula- 
t Lon studies. 


2. MATHEMATICAL PRELIMINARIES AND STABILITY RESULTS 

Some well known concepts and results of stability theory which find frequent 
application in the analysis of adaptive systems are included in this section. 

While they can be readily found in any good text [11-13] we present them here for 
easy reference as well as to place some of the problems discussed in the follow- 
ing sections in proper perspective. We start with the definitions of uniform 
asymptotic and exponential stability of the solution x = 0 of an equation 
i = f(x,t), f(0,t) = 0. We assume that f is continuous and satxsfies conditions 
which guarantee the existence and uniqueness of solutions and continuity of their 
dependence on the initial conditions. The general solution of the dif ferential 
equation is denoted as p(t,x 0> t Q ) with * x o # 

(i) Definition (Uniform Asymptotic Stability): The equilibrium x ® 0 of the 

differential equation x * f(x,t) is uniformly asymptotically stable if it is uni- 
formly stable and for some > 0 and all e 9 > 0 there is a > 0 such that 

|Jx 0 II < Gj implies II p(t,x 0> t 0 ) II < e 2 for all t z t n -1 T. 

(ii) Definition (Exponential Stability): The equilibrium state of the equation 

x « f(x,t) is exponentially stable if two positive constants a and 8 which are 
independent of the initial values. exist such that for sut Lently small initial 
values, ||p(t,x 0 ,t 0 ) || < B|Jx Q ||e"° “ C 0\ 

A linear time-invariant system with f(x,t) * Ax where A is a constant matrix 
is asymptotically stable if the eigenvalue, of A are in the open left Y .If of the 
complex plane. Asymptotic, uniform asymptotic and exponential stability are 
equivalent in this case. For linear time-varying oystems f asymptotic stability 
does not imply uniform asymptotic stability whereas the latter is equivalent to 
exponential stability. For linear systems, all stability properties hold in the 
large. In general, for nonlinear systems exponential stability implies uniform 
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asymptotic stability but not vice versa. If f(x,t) is autonomous or periodic in 
t, all stability properties are uniform. 

In robust adaptive control we are interested in deducing the propertied of 
the solutions of a perturbed system (S ) from the behavior of the solutions of an 
unperturbed system (S) . These are described by the differential equations 

'' ■ f(x,t) (S) ; x * f(x,t) + g(x,t) (S p ) (1) 

Let the equilibrium state of (S) be exponentially stable. If ||g(x,t)|| < L |] x |l 
for sufficiently small b and 6,and|l x |j< 6,then the equilibrium star'* of (S ) is 
also exponentially stable [11]. In physical situations the condition g(0,t? * 0 
required above is not generally met and this gives rise to the concept of total 
stability. 

(iii) Definition (Total Stability) [?1]: The equilibrium state x *■ 0 of (S) is 
totally stable if for every e > 0 two positive numbers 6^(e) and 6^(0) exi3t such 

that every solution p(t,x ,t.) of (S 1 satisfies It p(t,x n ,t-) II < e> t % t. provided 
|* 0 |< .nd*g(x,t) |<°S 2 ? p I' 0 0 0 

In the Russian literature this is also referred to as stability under per- 
sistent disturbances. The uniform asymptotic stability of the >erturbed system 
implies total stability [11] and is frequently used to prove robustness of adap- 
tive systems in the presence of sufficiently small perturbations. Recently the 
magnitude constraint on ||x Q || in the definition of total stability has been relaxed 
by Anderson and Johnstone [8] at the expense of stronger conditions on f(x,t). 


In practical systems we are interested in the uniform boundedness of the solu- 
tions in the presence of perturbations as well as in the magnitudes of this bound. 
This leads to the concept of practical stability defined below. 

(iv) Definition (Practical Stability) [12]: Let * (x|||xH< <5^1 bo open 

set in |R n and 6^ > 0 a constant suph that || g(x r t) |) < 6^ ^ or x an< * c 

If the solutions of (S^) lie within a closed bounded set Q OQq f° r x q e Qq then 
the system (S) is said to be practically stable. 


Total stability assures the existence of and 6^ relative to which the sys- 
tem (S) is oractically stable but provides no way of estimating the size of or 

the magnitude of 5^. In adaptive control applications this is not adequate. One 

is more interested in determining an estimate of Q from a knowledge of 6^* 

An alternative method for treating the effect of pertur tions is by consid- 
ering them as bounded independent functions of time. This leads to the w» 11 known 

concept of bounded input - bounded output (BIBO) stability. 

(v) Definition (BIBO Stability): A system x « f(x,u,t) with £(%0*t) * 0 is 

BIBO stable if for every a s 0 and every a £ 0 there is a 8 * B(a,a) such that 
JI P u <T> VVII - 8 for ail t J tg for every initial condition (x^.t^) with 

*| <- a and 8£p||u(t) |jf a, where P u (V^ 0 »t 0 ) is the solution of the system with 
ir u(*). 
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A linear system x = A(t)x + b(t)u is L' 60 stable if the homogeneous part is 
uniformly asymptotically stable. This is a property which is frequently used ic 
robust adaptive control using the concept of persistent excitation. In contrast 
to the above, uniform asymptotic stability of a nonlinear system does not imply 
BIBO stabilitv. An example of this was given by Deso^r et al [14]. A similar 
situation arises in the discussion of robustness of a second order nonlinear sys- 
tem in section 4. 

Stability Problems in Adaptive Syste j: The study of the stability or adaptive 
systems (as sh^wn in 'the following sections) can be conveniently carried out using 
a set of nonlinear time-varying error differential equations. Even in the 
or disturbance free case the time-variations arise due to the presence of the 
reference input r(«). The following are some noteworthy features of many of the 
stability questions which arise in adapt e systems. 

(i) In the Jdeal case, a Lyapunov function V > 0 with v ^ 0 can be found. The 
negative sem? -definiteness of cannot he avoided and is a resuU. cr the adap*:iv^ 
law used. 


(ii) a result of (i) even the unforced (autonomous) ^ tta is uniformly sta- 
ble. Even when tne reference input is persistently exciting, $ 0 but the system 
can be shown to be uniformly asymptotically stable [15]. We note that LaSalle's 
theorem cannot be directly applied to prove this since the system is nonautonomous. 

(iii) Since the system is exponentially sta. le with a persistently exciting ref- 
erence input, Malkin's theorem can be used to conclude that the solutions will be 
bounded for some initial set and perturbation of magnitude $ 6£. However, very 

little can be said directly about either or 

(iv) Another cc .sequence of the semidefiniteness of V is that assuring even the 
boundedness of solutions using Lyapunov's Direct method for given bounus on per- 
turbations is no longer trivial. Some of these cases are considered in section 3. 
In section 4 it is shown that even v 1 m the reference inpuv is persistently excit- 
ing, if the disturbance is i->cge thfc solutions can be unbounded. Alternately, for 
a given bound on the disturbance the persistent excitation can te made sufficient- 
ly large to assure the boundedness of the solutions. 

3. RECENT RESULTS IN ADAPTIVE CONTROL 


In this sect!* n we attempt to survey briefly some o*. the theoretical results 
currently known ir the area of robust adaptive control. The aim of the section la 
to provide an understanding of the qualitative idea*, that led to these result* a * 
well as the analy ical tools used in deriving them. Since the ideal system lorms 
the starting point of all perturbation a.i^lyses, we shall .brief ly outline the 
statement of the problem and the proof of stability in chis case. Further, while 
several stable adaptive algorisms have been suggested in the literature, we shall 
discuss the proof of stability using only a algorithm proposed in [16]. The 
proofs using all the other algorithms follow along similar lines. 


a) Ideal System ; The plant to be controlled is described by the ?"ate equations 


Ax + b u 
P P P 


r r 

y * c 
p p p 


( 2 ) 
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and a reference menial is described by 


• T 

x = A x + b r ; y » c x 
m mm m m mm 


( 3 ) 


where k , u and y are respectively the state input and output of the plant and 

x an« ? y_ are P the state anu output of the model. The transfer functions of 
m m * 

the plant and model are 


K 2 (s) 

W (s) * c (sI-A ) b = r 

P p p p n ,8) 

The following assumptions ai r ide regarding W (s) and W (s) 

p m 


T -1 

k (8) * c (sI-A ) b » 
m m a m 


*m 


R (s) 

m 


(i) Z (s), R (s) and t< *) are moric polynomi Is of degrees m, n and n*=n-m 
P P * 

(ii) Z (s) and R (s) ai*. . tric^ly seable polynomials 
p m 


and (iii) r is a piecewise continuous uniformly bounded reference input. 


The objective is to co^tr^l rhe plane in such a fashion that the output error 
between plant and model e^ * tonds to zero asymptotically, while the sig^ 

nals and parameters of the system remain uniformly bounded. It is now well known 
that knowledge of the exact relative degree n* of the plant, an upper bound n on 
its order, the sign of the gain and the condition that Zp(s) be Hurvitz as 

given in v ii) are needed to solve *he problem, n* enables the model to be con- 
structed while the value of n determines the order of the controller to be used. 
The sign of K and the constraint on Z (s) are needed to prove the stability of 
the overall system. ^ 

Structure of Controller : In the following we shall assume that K is known and 

K * K * 1. To meet the control objective a controller describeB by the follow- 
p m 

ing equations is used: 

; (1 > = Fw (1) + gu ; J> (2) - Fu/ 2 ^ + gy p ; U - e T w + r (4) 

where F is an asymptotically stable nxn matrix, (F,g) is controllable, 

T T 

T (1) (2) 

id * [id ,tu ] and 0(t) is a 2n dimensional parameter vector which is to be 
adjusted adaptively. It Is well known [17] that a unique constant vector 0* 
exists such that the transfer function of the plant together with the controller 
matches that of the model exactly, when 0(t) = 0*. The aim of the adaptive law 
is to adjust 9(t) in such a meaner that the overall system is globally stable and 
lim e (t) ■ 0. 
t-*» 1 

While several special cases of the adaptive control problem have been con- 
sidered, we discuss below the general case when W^(s) has a relative degree 

n* > 2. If 0(t) - 0* « $(t) , then <p is the parameter error vector and the output 
of the plant can be expressed as 

y (t) =* W (s)[r(t) + <j> T (t)u)(t)] (5) 

p m 
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The Adaptive Scheme ; To generate the adaptive law an auxiliary error signal y (t) 
is added to e^(t) to generate an augmented error e^(t). If & 


y (t) = ie T (t)w ( S )i - w (s)e T (t)]u>(t) 

a mm 


( 6 ) 


then 


4> T (t)C(t) = e^t) + y fl (t) = e^t) (7) 

where W (s)lu = £• The adaptive law for updating 0(t) then depends on the aug- 
mented error e (t) and the signal £(t) and is given by 

- e |(t)C(t) 

♦ (t) = 9(t) ^ (8) 

1 + C (t)C(t) 

This has been shown to result in global stability of the adaptive loop [16]. 

T 

Proof of Global Stability : If V($) ** 1/2 $ the adaptive law (8) yields 

V(+) - i 

1 + C (t)c(t) 

from which it follows that 

(i) <f> and $ are uniformly bounded 

(ii) ♦ e L 2 (9) 

and (iii) El (t) - v(t)tl + c T (t)c(t)] 1/2 . v e L 2 

Since the complete proof is too long and involved to be included here in its 
entirety we merely outline the principal steps involved. 

(a) Since the parameter vector is bounded by (i) it is first shown that 

supjy (x) j ^ sup ||u/ 2 ^(t)|{ * sup ||i*>(t)|| ^ sup lU(x)B (io) 

Tit P Tit Tit Tit 

Here 'v is an equivalence relation and Implies that the corresponding signals in 
(10) grow at the same rate [18]. 

. 2 

(b) Since ^ c L it can be shown that y (t) grows at a slower rate than 

a 

sup |u(t)Q denoted by y (t) * o[sup (u>(t) g ]. (11) 

tit a Tit 

(c) From (5) , (9-iii) and (11) it follows that 

e - W <|> T w * v[l + C 1 1 ^ 2 + o[sup ||<d(t)|| ] (12) 

1 “ 2 Tit 

(d) Since v e L using equation (4) we conclude that 
sup|| u/ 2 ^ (t) || ■ o sup || ci>(t) || which contradicts (10). 

Tit Tit 

Hence all the signals in the system are uniformly bounded and lim e^(t) * 0. 

t-HJO 
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The importance of demonstrating the boundedness of $(t) and ^ e [ in the 
proof of stability is worth noting. [In some cases it may be possible to show 
that lim $(t) = 0, which serves the same purpose.] The former assures that the 

t-*» 

relevant signals in (10) grow at the same rate while the latter is used to prove 
that |y (t)| and Jj || should grow at different rates if the adaptive control 

is used, leading to a contradiction. 

Asymptotic Stability of the Ideal System ; Once the boundedness of all the signals 
in the adaptive system has been established, interest shifts to the convergence 
of the parc'seter vector 0(t) to its desired value 0* or equivalently of <fr(t) to 
the null vector. Since the adaptive law (8) can be represented as 

i(t) - - (t) — *(t> d3) 

1 + C (t)c(t) 

the conditions that have to be imposed on c(t)'to accomplish this is of interest. 
Following the results of Morgan and Narendra [19] if £(t) is persistent- 

/1 + c T (t)c(t) 

ly exciting lim <fr(t) * 0 and the convergence is exponential. Since W (s)Ioj * 5 , 

t-H» m 

a sufficient condition for £(•) to be persistently exciting is that w(*) is per- 
sistently exciting [15]. Hence conditions under which <*)(•) will be persistently 
exciting have been investigated by several authors [15,20-22]. 

Persistent Excitation (PE) of qj(t) and u)*(t) : Early results on the convergence of 

the parameter vector to the null vector were stated in terms of the PE of co(t). 
However since ui(t) is a dependent variable within the adaptive loop, very little 
can be said directly about its persistent excitation. Hence attempts were made to 
express this condition in terms of the PE of signals in the model which are at the 
discretion of the designer. Since the adaptive system and model transfer functions 
are identical when 0(t) = 0*, the model can be parametrized in such a fashion that 
a signal u* in it would correspond to the signal u>(t) in the adaptive loop. Fur- 
ther since the model is time Invariant, conditions on r(t) which would assure the 
PF of w*(t) can be derived* If w(t) ■ u>(t) - to*(t), the adaptive law assures that 
lim w(t) = 0. Hence, in the ideal case the PE of w*(t) ensures the PE of w(t) 

t-H» 

and hence the convergence of the parameter vector 0(t) to its true value. 

Comments : 

(i) The above arguments have focussed attention on several interesting ques- 
tions related to persistent excitation and transformations -under which the proper- 
ty is preserved [15]. 

(ii) The convergence of «(t) to 0 is used above to show the PE of co(t) and hence 
the convergence of $(t) to 0. This is no longer possible when an external dis- 
turbance is present since even the boundedness of the signals is not assured in 
such a case. 

(iii) From the results of several authors it is now known that an almost periodic 
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reference input with n-distinct frequencies results in the PE of u)(t). 

b) Adaptation Under Perturbations : The adaptive control system described in sec- 

tion (3a) assumed ideal conditions* The plant was linear and time-invariant and 
no external disturbances were present* In addition, considerable prior knowledge 
of plant transfer characteristics was assumed to help in setting up a reference 
model and deriving stable adaptive laws. As mentioned earlier, plants are rarely 
strictly linear or finite dimensional and in many practical situations the need 
for adaptive control arises due to large parameter variations* Also, external in- 
put and output disturbances are invariably present in real systems* Hence there 
is a definite need to extend the theory developed for the ideal case to situations 
with modeling errors and external disturbances* Some of the schemes that have 
been proposed in recent years to achieve robustness in the presence of such per- 
turbations are briefly reviewed in this section and some new results are reported 
in sections 4 and 5* 

The basic adaptive system in the ideal case is only uniformly stable* This 
implies that bounded perturbations can theoretically produce unbounded outputs* 
When the reference input is persistently exciting, the nonlinear system is uni- 
formly asymptotically stable in the large and exponentially stable when the ini- 
tial state Xq lies in a finite ball around the origin* The latter fact allows 
BIBO results to be derived using theorems cf the type described by Malkin, pro- 
vided the perturbations are sufficiently small* However, as pointed out in sec- 
tion 2, very little can be said using such an approach about the effect of bounded 
perturbations of a specified maximum amplitude on the global behavior of the solu- 
tions of the adaptive system* In addition to such perturbation methods a few 
global methods have also been used to derive results in robust adaptive systems* 
The principal concepts involved in deriving some of these are discussed below* 

(i) Use of Dead-Zone {5] : The problem statement is similar to that given for the 

ideal system with the exception that y * c ^x + v, where v, is a bounded dis- 
7 P P P 1 1 

turbance. Using the same adaptive law (8) as in the ideal case, the error equa- 
tions can be expressed as 

♦ T (t)c(t) + v(t) - e(t) (14) 

and 

$(t) * (adaptive law) 

1 + t (t)Tt(t) 

where v is an equivalent output disturbance due to v- . The difficulty arises due 

T 1 T 

to the presence of v(t) in (14). When sgn[$ £ + v] * sgn[4> £] the adaptation is 
in the right direction. Otherwise the parameter vector may be adjusted away from 
its desired value. This implies that problems of convergence may arise when e(t) 
is of the order of the vound V- of v(t). The modification in the algorithm sug- 
gested in [5] is to use a dead-zone so that the adaptive parameters are not ad- 
justed when e(t) lies inside it* Hence the overall system operates in two modes — 
a linear time- invariant mode when |e(t)| £ v 0 + ® (for some constant 6 > 0) and 
an adaptive mode otherwise. In [5] It is shown that such an algorithm results in 
a system with bounded signals* Further, adaptation takes place for only a finite 
time* This implies that in practice the system will converge to a linear time- 
invariant system in a finite time after which the output error will lie entirely 
in the dead-zone and hence adaptation ceases entirely. 
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(ii) Bound on H 6* jj : An alternate approach to the bounded disturbance problem 

was taken by Kreisselmeier and Narendra [6], While the statement of the problem 
as well as the structure of the controller are identical to that in (i), it is 
assumed that no knowledge of a bound on the disturbance is available. Instead, 
it is assumed that the desired vector 0* has a norm less than a specified value 
M 0 *Hma X * Hence the search procedure can be confined essentially to the set 

S: < 6 ! U°U ‘I 0* Hmax^* 1116 ada P tive law used t0 update 6(t) is identical to that 

in the ideal case when 0 lies in the interior of S and is modified when it reaches 
the boundary of S, or lies outside it. In [6] it is shown that such a scheme re- 
sults in the boundedness of all signals in the system. 

Apart from the obvious differences between the schemes suggested in [5] and 
[6], there are mathematical differences in the proof that are worth stressing. 

As in [1-4], the proofs of stability in [5] use limiting arguments as t + * to 
show that $ e Such a procedure cannot be used in [6], since $(t) does not 

tend to any limit as t -*• «. Hence all arguments are based on the analysis of the 
behavior of the systea over a finite interval. As shown in section 6 the approach- 
es in (5] and [6] complement each other and can be .. ombined to have wider applica- 
tion in adaptive systems in the future. 

(iii) The o-modif ication Scheme ; In approaches (i) and (ii) certain prior infor- 
mation is assumed to implement the adaptive laws. In contrast to this, a scheme 
suggested by Ioannou and Kokotovic [7] assures boundedness of all signals in the 
system, without any assumptions regarding the bounds on either the disturbance or 
the control parameters. However, to the authors' knowledge, the method has been 
shown to result in global boundedness only for the special case when the refer- 
ence model is strictly positive real. 

The method is based on the following simple ideas. If V(e,$) is a quadratic 
Lyapunov function candidate, the time derivative $(e,$), along a trajectory, is 
generally a quadratic function of e and hence is negative sem^def inite . When^a 
disturbance is r resent, V(e,$) has the general form -e L Qe + e av, where Q * Q > 0, 
a is a constant vector and v is a bounded disturbance. Very little can be con- 
cl*. led regarding stability from this and accounts for the modifications suggested 
in [5] and [6]. In [7], an additional term -o0 is used in the adaptive law, as a 
result of which V(e,$) becomes negative definite outside a bounded region in the 
(e,<(>) space. From this it is concluded that all signals in the system are bound- 
ed. 

(iv) Adaptive Systems and Time-Varying Plants : The methods outlined in sections 

3b(i-iii) deal with the global behavior of the adaptive systems when bounded per- 
turbations are present. In contrast to this Anderson and Johnstone [ 81 examine 
adaptive control problems where the assumptions made regarding the system deviate 
slightly from the ideal. While [ 8] addresses primarily the problem of time- 
varying plant parameters the authors claim that the same methods with remarkably 
little change allow examination of the effect of measurement noise, plant nonlin- 
earity and undermodelling of the plant order. 

As in our discussions in section (3a), the authors first consider the ideal 
system and demonstrate uniform or exponential stability in the presence of per- 
sistent excitation. For the various types of perturbations considered, their aim 
is then to show that the resulting equations can be cast in such a form that the 
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total stability of the overall system can be demonstrated using modifications of 
Malkin s theorems* however, as mentioned earlier, the theorems are useful primar- 
ily for establishing *he existence of robustness in the presence of sufficiently 
small perturbations rather than for providing guidance in the choice of the con- 
trol input to assure boundedness of solutions when the class of perturbations is 
specified. 


4. PERSISTENT EXCITATION AND ROBUSTNESS 


In the last section, we discussed two approaches of studying the robustness 
problem in adaptive systems* The approach in 3-d assumed that the perturbations 
were sufficiently small and derived BIBO results local in nature, using Malkin's 
theorem, whereas in 3a-3c the approach was global in nature and used additional 
information regarding plant dynamics and the external perturbations* Also, the 
first approach made use of the PE of the reference input which was not needed in 
the second* 

In this section, we present some new results which demonstrate global bound- 
edness of all signals in the adaptive system in the presence of bounded disturb- 
ances when the reference input is sufficiently persistently exciting. We show 
that by analyzing a set of nonlinear error differential equations, we can estab- 
lish the global robustness behavior of the adaptive systems* In particular, it is 
shown that if the persistent excitation of the model output is larger than the 
disturbance, the solutions will be globally bounded and that if the maximum ampli- 
tude of the disturbance is greater than that of the model output, the system can 
have unbounded solutions* The basic idea is stated here by considering the adap- 
tive control of a first order plant and studying the corresponding second order 
nonlinear differential equations in detail* The same methodology is 
applicable to the general adaptive control problem* 


Nonlinear Error Equations : The plant to be adaptively controlled, the correspond- 

ing reference model and the resulting error equations are as follows: 


Plant: 

Model : 

Error Equations: 
Adaptive Law: 


m 


ay + u + v: u 

p p 

-y + r 


0y + r 
P 


e l " ' e l + % + v 

♦ " -Vp 


(15) 


where r is the reference input, v is a bounded input disturbance, e^ is the output 

error defined as e, * y - y and 6 is the parameter error. In the ideal case, 

1 ^p 'm T 

when v(t) = 0, by considering 


e^t) - -e^t) + $(t)y p (t) 
$(t) * -e 1 (t)y p (t) 


(16) 


it immediately follows that the system is uniformly stable and if y (t) is per- 
sistently exciting, the system is exponentially stable. When a ^ disturbance 
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v(t) is present, it is tempting to proceed as in the ideal case and require y (t) 
in (15) to be persistently exciting so that the unperturbed system is expo- ** 
nentially stable resulting in a bounded error vector for bounded perturbations. 
Since stability of the overall system has not been established, y (t) cannot be 
assumed to be bounded and proving that it is PE becomes specious,* 5 Hence we have 
to express the right hand side of (15) in terms of the model output y (t) which 
is an independent variable rather than the dependent variable y (t), ihis results 
in the following nonlinear error differential equations: ** 

e (t) = -e (t) + <j>(t)y (t) + *(t)e (t) + v 

1 2 U7) 

$(t) = -e (t)y (t) - e (t) 
l m 1 

By analyzing the above nonlinear differential equation, we demonstrate the global 
behavior of the adaptive system in the presence of v(t). 


The Idea] System : In the absence of external- perturbations, the nonlinear system 

(18) 


e (t) = -e (t) + <Kt)y (t) + $(t)e (t) 
it m l 


<J>(t) = -e. (t)y (t) - e (t) 

1 m 1 

can be shown to be uniformly asymptotically stable in the large as follows: If 

12 2 2 
W(e^,$) « y [e^ + <(> ] , the time-derivative W[e^,<J>] * -e^ < 0, Hence the system 

e,(t) and <f>(t) are uniformly bounded for all t > t n , if W[e- (t n ) *4>(t n ) ] < 00 . 

1 2 * -UlUO 21 

Since e^ e L and e_ is bounded, litn e (t) * 0. The nonlinear vector [<(>e ,-e^ ] 

t~H» 

can be considered to be the input to the linear part which is exponentially stable 
if y m (t) is PE, As e^ 0 as t -► ®, this input tends to zero and hence x(t) + 0 

as t + ® where x = [e^,4>]^. Since all the arguments ere independent of the init- 
ial time tg and the magnitude of the initial conditions, the system is u,a.s,l. 

It is also worth noting that when y^(t) is PE, the linear part of (18) is expo- 
nentially stable but the nonlinear system is exponentially stable only when the 
initial state x(t n ) lies in a finite ball around the origin and not globally ex- 
ponentially stable. 


Perturbed System : To provide some insight into the behavior of the nonlinear sys- 

tem, we shall discuss three cases where the perturbed nonlinear system (17) is 
autonomous. 


Case (i) y (t) = 0: When v(t) = 0, th^ system is uniformly stable. If v(t) = v , 

m max 

lim <j>(t) =* -oo and lim e (t) * 0. 

t-Kr t-HX> 

Case (ii) y (t) = y : The unforced system in this case is autonomous and, by 

m max 2 

LaSalle’s theorem, is u.a.s.l. since the largest invariant set in E * {x|e, « 0} 

is the origin. However, since the system Is nonlinear, it no longer follows that 
a bounded input will result in a bounded output. If, for example, v(t) = -v , 
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where v > y , we can show that lim e„ (t) = -y and lim d>(t) = 
max max ^ l v 7 7 max T 

t-x» t-*» 

Case (iii) y (t) = y ,y > v : The system is Lagrange stable. When 

m 7 max max max J v & 

v(t) = -v , the system has an equilibrium state at (0, —2^- ) which is 
max v 

max 

u.a.s.l. Similarly when v(t) = v max » the system has an equilibrium state at 

( 0 , - ). 

^max 

The above special cases reveal that the behavior of the nonlinear system is 

very much dependent on y and v .In particular, when y (t) = y and 

max max m max 

v(t) = -v , the system has unbounded solutions when v > y and all solu- 
. max, . , , „ max max 

tions are bounded when y > v The results also carry over to the general 

max max J 

case when both v(t) and y (t) are time-varying and are stated in the following 
main theorem of the paper? (Fig. 1) 


Theorem l:Let |y (t) | < y , |v(t) | 
* 1 m - max 1 1 


* v and y (t) be a smooth persistently ex- 
— max m 


citing signal in the sense described in [23]. This implies that positive numbers 

T ,e and 6 exist such that given any t 1 > 0, there exists a t 0 e [t ,t + T ], 

U U U i 1 , t Z 11 u 

] and — I 2 0y (x)dx| > e . Then 

T o Jt„ m - 0 


with U 2 ,t 2 + 6 q ]C + T Q 


(a) If v 


max 


< v max * by choosing an input v(t) as 


v(t) =* -sgn(y (t))v when I e- ( t) I > y 

® max 1 1 1 — 'max 

■ sgn(e-(t))v |e-(t)| < y 

”1 max 1 1 1 J max 

where sgn(a(t)) * r /IT T when a(t) f 0 and is equal to unity when a(t) » 0, there 

1 3. ( 1 ) I 

exist initial conditions for which lim $(t) = “°° and e^(t) approaches asymptotic- 

ally the region |e^| < Y max + e » where e is an arbitrary positive constant. 

(b) If > v max + ^ where 6 is any arbitrary positive constant, then all the 
solutions of the differential equation (17) are bounded. 


Proof ; 

a) Let be the open domain enclosed by the line e^ » -v^ and the curve 
e + v 

<f> m — with < 0. When y (t) = y and v(t) * -v all solutions 

1 max 

that start on the boundary d(D^) enter D^. Since the system is autonomous and 

contains no singularities in D fall solutions originating D- are unbounded and 
lim d>(t) “ -«>, lim e 1 (t) * -y 

t~H» t-+°° 

For a time-varying signal y (t) the proof of unboundedness is related to the 
above autonomous case. Consider ro the solution of the differential equation with 
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initial condition (0 , 4 >q) with <f> < 


- v 

— — » with y (t) = y and v(t) = -v . 

w m max max 


Let r + denote the open curve along which the trajectory lies for all t 5 0. Simi- 
larly let r_ denote the curve along which the solution lies for all t * 0 when 
v(t) = v max ”and y(t) = -y^. Let r(<*> Q ) = r + U T_. T^) divides the plane into 

two open regions D 2 and D 2 ° where (0,<j>) e D 2 if <p < <p Q . Then all solutions of the 

differential equation with |y m (t) | $ and |v(t) | $ v with initial conditions 

on r(<{> 0 ) lie either in I'(^) or enter D^. Since this is true for every <j> 0 » the 

solutions are unbounded and lim ( t) = 

A T t ' H ” 2 

b) Let x = [e ,$]. Let D denote the region in H D#{x| |e | < v } and let 

^ IJ i 

D denote the complement of D. If W(x) = 1/2 x x, the time derivative of W along 

a trajectory is W(x) = + e^v < 0 for x e D C . Hence ||x || decreases in D C and 

can increase only in D. We wish to show that a constant c exists so that if 
lino'll " > c over an interval [t Q ,t 0 + T Q ], then||x(t 0 + T Q )|| < c^ 

If ||x(tg)|| » Cq, integrating the equation for e^ in (17) it can be shown that 

C TTJQV 

if x(t_) e D, then x(t,) e D for some t, e [t_,t rt + T.J if c_. > - ■ ■ ■■■ , where 
u t 1 u u u u sins 


cot 6 = 


2[T + 1] 

(e_-v )T 
0 max 


Hence under the conditions specified in the theorem, the 


trajectory invariably enters D during every period Tq. By increasing 

mono conically, the trajectory can be made to lie in a subdomain of D C for a finite 
time A with 0 < A < 6_ over every period. Since ||x(t)| decays exponentially in 
this subdomain, a constant c > c^ exists satisfying the conditions of the theorem. 

Comments ;j. The positive limit set of any solution x(t) lies in D. 

2. Cq will be referred to as the degree of persistent excitation. By the theorem, 
the solutions are bounded if e_ > v but the nature of the limit set depends on 

V ‘0 -<* V 0 “ 

3. From the theorem it follows that for a given bound v on the perturbations, 

the system can be made robust by increasing the degree max of persistent excita- 
tion. Note that this is an example of practical stability. 

4. The conditions for boundedness and unboundedness of solutions are given in 
this case in terms of y (t). For design purposes it is more appropriate to ex- 
press them in terms of m the reference input r(t). 


HYBRID ADAPTIVE CONTROL 


In continuous adaptive systems of the type described in the previous sections, 
the plant operates in continuous time and the controller parameters are adjusted 
continuously. Recent advances in microprocessor and related digital computer tech- 
nology favor the use of discrete systems in which signals are defined at discrete 
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instants. Practical systems on the other hand may contain both discrete and con- 
tinuous elements. Such systems may be described as hybrid systems. In a recent 
report [10] the authors have developed analytical models of hybrid ystems in 
which control parameters are adjusted in discrete time even as the continuous 
plant signals are processed in real time. The same algorithms can also be extend- 
ed to control disci ete time plants so that the overall discrete system operates 
on two time scales - a fast time scale in which the system operates and a slow 
time scale in which the control parameters are updated* We shall refer to such a 
system as a discrete hybrid system* 

In this section we describe briefly one of the hybrid adaptive algorithms and 
demonstrate global stability in the ideal case of an adaptive system which uses 
such an algorithm. The behavior of a discrete hybrid system is then discussed 
when bounded external disturbances are present* Using the results of the previous 
section , arguments are put forward as to why hybrid schemes should result in more 
robust systems and simulation results are presented to show that this is indeed 
the case* 


a) Hybrid Error Model : In this section we consider the first of several hybrid 
error models given in [10] and discuss its properties* Similar results can also 
be derived in all the other cases* The error model Is described by the equation 


<j> k u(t) = e^t) 


C e 


k e N 


(19) 


where u: JR + |Jt ra , e_: |R + are piecewise continuous functions which are refer- 

red to as the inputand output functions of the error model* {t, } is a monotonic- 

+ k 

ally increasing unbounded sequence in [)R with 0 < ^ ^ S ^max < °° ^ or k e ** 

where - t^. When T^ * T, a constant, we shall call T the sampling peri- 
od. 4> : is a piecewise constant function, referred to as the parameter 

error vector and assumes values <$>(t) « 4^, t c where $ is a constant 

vector* 


It is assumed that (and hence is unknown, the values u(t) and e^(t) 

can be observed at every instant t and ^ ^ can ac *justed at 

t * t. + .« The objective is to determine an adaptive law for choosing the sequence 
{ A<(>- } using all available input-output data so that lim e (t) ■ 0* 
k t-K* 1 

Theorem 2 : If in the error equation (19) the vector is updated according to the 
adaptive law t. t - 

k+1 e. (t)u(t) 

£ dx (20) 

- \ 1 + u (t)u(t) 

then 



(D 

(ii) 


if u(t) and u(t'> are uniformly bounded in |R lim *^(t) * 0 

if in addition to the conditions in (i) u i* persistently exciting over an 

interval T . , lim 6 . * 0 
min , r k 

k-H» 
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°o T 1 / 2 2 

(iii) If u e L then e^(t) = p(t)[l + u u] , p e L . 

IT 11 

Proof : If V(k) - y ^ using the adaptive law (20) we obtain AV(k) * “ y ^k 

f 21 - < 0 

i r k+1 /■ x t, , 

where R, = f . < x) dt . 


fc k 1 + u^(x)u(t) 


Hence V(k) is a Lyapunov function and assures the boundedness of $ . Since 


Z AV(k) < <*> it follows that lim AV(k) = 0. Hence 
k=l „ k-H*> , f t 


U„ * \» k - lim A- 

k-*» k-H» k 


k+1 


e x (r) 


dt = 0. 


t fc 1 + u (t)u(t) 

(i) If u is bounded, e^ is bounded and e^ e . If u is bounded lim e^(t) = 0. 

(ii) If u is persistently exciting is uniformly positive definite and hence 

<(>k 0 as k -> <». 

OO r - <— » »!■■— 2 

(iii) If u grows in an unbounded fashion with u e L , e- * p/.. . T where pel* 

" e 1 1 + u u 


Comments : In the three cases given in theorem 2 the first two assume that the 

input u is uniformly bounded and the corresponding results are applicable ) the 
identification problem. The third case which treats unbounded inputs is applic- 
able to the control problem. 


The fact that T^ need not be a constant is also worth noting. As shown in section 
6 a time-varying period may be used to improve the transient response of the sys- 
tem* 


b) Stable Hybrid Adaptive Control - Ideal Case : The hybrid adaptive algorithm 

described in the preceding section can be used to adjust the control parameters 
of a hybrid adaptive system. Using an approach very similar to that used in sec- 
tion 3 for a continuous time system the overall system can be shown to be globally 
stable o Using the same notation as in section 3 we have for the adaptive law 

, (^k+l e 1 (T)c(x) 

A *k T~ lt ~ T dT 

k ^k 1 + C (t)c(t) 

From the analysis in the previous section we conclude that 


(i) the parameter error vector is bounded 

' *j» 2 

and (ii) e^-p/l+cc where pel, 

whi^h conditions are the same as those ohtained for the continuous case. Condi- 
tion (i) assures that the signals y^, ui ^ , ||u(t)|| and|^(t)j| grow at the same 

rate. Condition (ii) results in |y (t) j ** o sup||co(t)|| which contradicts the pre- 

p r 

vious assertion proving the boundedness of all the signals. 

The similarity between the continuous and hybrid systems also extends to 
cases when external bounded disturbances are present and the methods described in 
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sections 3 and 4 apply to the hybrid case as well. However, as shown in the fol- 
lowing section, the use of averaged values over an interval rather than instant- 
aneous values, results in more robust control. 

c) Adaptive System with Two Time Scales : The hybrid adaptive algorithm developed 

in section 5a and applied to hybrid adaptive systems in section 5b can also be 
s>’ ‘ modified for discrete hybrid systems or discrete systems with two time 

. es, shown below. 


Let the output error e^(k) e IR^ and the parameter error vector $(k) e fR n be 
related by tne error equation 


<t> (k)w(£) = e^O 


k,£ e N, £ e[kT,(k+l)T] 


( 21 ) 


where <J>(k) is a constant vector over the interval [kT, (k+l)T] , T e N and denotes 
the period of the interval and w(£) e (R is an input vector. Using information 
collected over the entire interval, the parameter error vector $(k) is updated at 
time (k-fl)T using the adaptive law 


. (k-fl)T-l e. (i)w(i) 

♦ <k+l) - 4>(k) = A4> (k) = - 1/T E 

i-kT 1 + w(i) w(i) 


where R(k) = 1/T 


(k+l)T-l 
E 
kT 


- R(kH(k) 

w(i)w(i) T 
1 + w(i)^w(i) 


( 22 ) 


In [10] it is shown that V(k) * 1/2 (J> (k) (k) is a Lyapunov function for the 
system (21) from which it follows that $(k) is bounded if $(0) is bounded and 


e.(i) 

lim T lj2 

i-Ko [l+w(i) w(i)] 


i e N 


(23) 


If the adaptive law (22) io used in a control system to update the parameters, 
equation (23) can be used to demcnstrate global stability [10]. 


When an external disturbance v is present the error equation (23) have to be 
modified as 


4> T (k)w(£) + v(£) = e 1 (£) £ e[kT,(k+l)T] 


(24) 


Using the same adaptive law as before, the error equation has the form 

(k+l)T-l ... ... 

A$(k) = -R(k)<J>(k) + E "( . ^ (i) 

i-kT 1 + w(i) w(i) 


(25) 


where s(k) 


(k+l)T-l 

E 

i=kT 


- -R(kH(k) + s(k) 
w(i)v(i) 


1 + w(i) f w(i) 


The matrix R(k) and the vector s(k) in algorithm (25) are averaged values 
over an interval rather than instantaneous values. Hence the equivalent system 
may be considered to have more persistently exciting inputs in its homogeneous 
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equation and a smaller magnitude of perturbation (if the mean value of the dis- 
turbance is small). Due to both reasons the outputs tend to be smaller. Simula- 
tion results shown in Fig. 2 indicate the dramatic improvement in performance. 

6. NEW DIRECTIONS 

The criteria for judging the performance of an adaptive control system are 
no different from those used for any conventional control system and include sta- 
bility speed and accuracy of response. In the preceding sections methods using 
persistent excitation of reference input, and nonlinear and hybrid adaptive algo- 
rithms were described which would make the overall system stable under perturba- 
tions. A judicious combination of these different methods may improve the robust- 
ness of the system substantially and result in schemes which are practically at- 
tractive. Some of these combinations as well as extensions of known methods which 
appear promising are given below. 

(i) Robustness of n^* 1 Order System Using Persistent Excitation : A detailed analy- 

sis of a first order adaptive system containing a single control parameter was 
giverv in section 4. When a disturbance Is present it was shown that a sufficiently 
large persistently exciting reference input would also result in bounded solutions. 
Further studies have revealed that similar conclusions can be drawn regarding 
higher order systems and research is currently being done to determine the bounds 
on the solutions, 

(ii) Hyb rid Adaptive Control : In the adaptive control system described in sec- 

tion 5, it was shown that the sampling interval T could itself be time-varying 

provided it lay in a bounded interval [T . ,T ] with T . >0. In practical 

min ma x min 

systems It appears possible to adjust T, on line to improve the transient response 
of the - n. 

(iii) eavi-Zone, Persistent Excitation and Plant Identification ; A sufficiently 
large dead-zone in the adaptive algorithm was shown to result in bounded solutions 
in section 3 C The results in section 4 indicated that boundedness of solutions 
could also be achieved by increasing the PE of the reference input. It therefore 
appears likely that the same results can be achieved using a combination of a 
smaller dead -zone and a smaller degree of persist' excitation. Simulation stu- 
dies have shewn that this is indeed the case and aL<.empts are being made to demon- 
strate this theoretically. 

When the reference input is persistently exciting and the adaptive loop Is 
stable, the plant parameters can be estimated on-line and used in second level 
adaptation to reduce the dead-zone further. Hence combining a dead-zone with PE 
of reference inputs appears to be oi both theoretical and practical interest. 

(iv) II 0* II and Persistent Excitation: . , ... , 

M 11 max As in (iii) a persistently exciting in- 

put enables 0* to be estimated and hence an attempt could be made to use ths infor- 
mation to decrease the region of search. 

(v) cr-modif ication and Persistent Excitation : The o-modif ication scheme, in its 

basic form, described in section 3 is unappealing, since the parameter error can 
be large if || 6*|| is large. Using identification methods as in (iii) and (iv) and 
estimating 0* on line, second level adaptive procedures may result in a smaller 

bias* 
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The second level adaptation problems stated in (ii)-(v) while practically at- 
tractive, lead to stability questions in more complex nonlinear systems. Further, 
it is worth pointing out that all of them consider external disturbances rather 
than perturbations in plant dynamics. The reduced order problem which deals with 
the design of a low order controller to adaptively control a higher order plant is 
generally agreed to be the single most important theoretical question in the field 
of adaptive control. While considerable research is being carried out in this 
area, it is acknowledged that even a proper formulation of this problem is a form- 
idable one. It is felt that the answers to some of the questions raised in this 
section will contribute significantly towards this end. 
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EMPHASIS ON AIRCRAFT FLIGHT DATA 
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ABSTRACT 


Accurate Modeling of flexible space structures is an important field that is 
currently under investigation. Parameter estimation, using methods such as max- 
imum likelihood, is one of the ways that the model can be improved. The maximum 
likelihood estimator has been used to extract stability and control derivatives 
from flight data for many years. Most of the literature on aircraft estimation 
concentrates on new developments and applications, assuming familiarity with 
basic estimation concepts. This paper presents some of these basic concepts. 

The paper briefly discusses the maximum likelihood estimator and the aircraft 
equations of motion that the estimator uses. The basic concepts of minimization 
and estimation are examined for a simple computed aircraft example. The cost 
functions that are to be minimized during estimation are defined and discussed. 
Graphic representations of the cost functions are given to help illustrate the 
minimization process. Finally, the basic concepts are generalized, and estima- 
tion from flight data is discussed. Specific examples of estimation of struc- 
tural dynamics are included. Some of the major conclusions for the computed 
example are also developed for the analysis of flight data. 


INTRODUCTION 


Accurate modeling of flexible space structures is an important area that is 
currently under investigation. The mathematical nr -deling of these structures 
can be improved using parameter estimation. Such techniques have been success- 
fully used to estimate aircraft stability and control derivatives and refine 
aircraft mathematical models. Some of the experience gained in the aircraft 
problem can be applied directly to analysis of flexible space structures. 


The maximum likelihood estimator has been used to obtain stability and con- 
trol estimates from flight data for nearly 20 years. The results of many appli- 
cations have been reported worldwide. Reference 1 contains a representative 
list of some of these reports. Several good texts (including Refs. 2 and 3) 
contain thorough treatments of the theory of maximum likelihood estimation. 
Experience reports (Refs. 1, 4, and 5) pointing out practical considerations for 


*Senior Staff Scientist. 

^Aerospace Engineer. 


PRECEDING PAGE BLANK NOT FILMED 


197 



applying the maximum likelihood estimator have also been published* Stability 
and control derivatives estimated f roa flight data are currently required for 
correlation studies with predictive techniques, handling qualities documentation, 
design compliance, aircraft simulator enhancement and refinement, and control 
system design. Correlation, simulation, and control system design applications 
(including the space shuttle) are discussed in Ref. 6. Current studies have 
concentrated on estimation model structure determination (Refs. 7 and 8), equa- 
tion error with state reconstruction (Refs. 9 to 11), and maximum likelihood 
estimation in the frequency domain (Refs. 12 and 13). 

Most of the reports in the estimation area concentrate on new developments 
and applications, assuming familiarity with the basic concepts of maximum like- 
lihood estimation. In this paper some of these basic concepts are reviewed, 
concentrating on simple, idealized models. These simple models provide insights 
applicable to a wide variety of real problems. 

This paper also presents sooe of the basics of maximum likelihood estimation 
as applied to the aircraft problem. It briefly discusses the maximum likelihood 
estimator and the aircraft equations of notion that the sstinator uses. The 
basic aspects of minimization and estimation are then examined in detail for a 
simple computed aircraft example. Finally, the discussion is expanded to the 
general aircraft estimation problem including specific examples of estimation of 
structural dynamics. 


SYMBOLS 


A,B|C,D, F ,G 


a N 


a x 

a y 

a z 

b 

c* 

C n 

Cx 

C Y 

C Z 


system matrices 

normal acceleration positive upward, g 
longitudinal acceleration, g 
lateral acceleration, g 
normal acceleration positive upward, g 
reference span, ft 
coefficient of rolling moment 
coefficient of yawing moment 
coefficient of axial force 
coefficient of side force 
coefficient of normal force 


f(*), g(-) 


general functions 



GG* 

g 

H 

lx* *xz 

i 

a 

Kp 

L 


L* 


l YJ 

M 


N 


n 

a 

Pg 

p 

q 

q 

R 

r 

s 

T 

t 

u 

V 


measurement noise covariance matrix 

acceleration due to gravity, ft/sec 2 

approximation to the information matrix 

moment of inertia about subscripted axis, slug-ft 2 

general index 

cost function 

sidewash factor 

rolling moment divided by X x , deg/sec 2 
rolling moment, ft-lb 
rolling moment due to yaw jet, ft-lb 
pitching moment divided by Iy, deg/sec 2 
mass, slug 

number of time points or cases or yawing moment divided 
by I z , deg/sec 2 

state noise vector or number of unknowns 

estimated roll rate due to turbulence, deg/sec 

roll rate, deg/sec 

pitch rate, deg/sec 

dynamic pressure, lb/ft 2 

innovation covariance matrix 

yaw : ce, g/oec 

reference area, ft 2 

time increment, sec 

time, sec 

control input vector 
forward velocity, ft/sec 
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X 


x ay*yay» 2 ay 


H 

a 

e 

A 

A 

& 

«a 

«e 

fir 

n 

e 

u 

c 

a 

T 

♦ 

;■ 

0) 

Subscripts : 

p,q,r,a,a,0,fi, 

fi*fia*fir*fi« 


0 

m 


state vector 

distance between lateral accelerometer and the 
center ol gravity along the appropriate axis, ft 

observation vector 

predicted Kalman-filtered estimate 

angle of attack, deg 

angle of sideslip, deg 

estimated angle of sideslip due to turbulence, deg 

time sample interval, sec . 

control deflection, deg 

aileron deflection, deg 

elevon deflection, deg 

rudder deflection, deg 

measurement noise vector 

pitch angle, deg 

mean 

vector of unknowns 
standard deviation 
time, sec 

transition matrix or bank angle, deg 

integral of transition matrix, or heading angle, deg 

Irequency, rad/sec 

partial derivative with respect to subscripted quantity 

bias or at time zero 
measured quantity 
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Other nomenclature: 


predicted estimate 

estimate 

transpose 

indicates moment in ft- lb 


MAXIMUM LIKELIHOOD ESTIMATION 


The concept of maximum likelihood is discussed in this section. First the 
general heuristic problem is discussed, and then the specific equations for 
obtaining maximum likelihood estimates for the aircraft problem are given. In 
the following sections, both the concepts and the computations involved in a 
simple but realistic example are discussed in detail. 

The aircraft parameter estimation problem can be defined quite simply in 
general terms. The system investigated is assumed to be modeled by a set of 
dynamic equations containing unknown parameters. To determine the values of the 
unknown parameters, the system is excited by a suitable input, and the input and 
actual system response are measured. The values of the unknown parameters are 
then inferred based on the requirement that the model response to the given 
input match the actual system response. When formulated in this manner, the 
problem of identifying the unknown parameters can be easily solved by many 
methods; however, complicating factors arise when application to a real system 
is considered. 

The first complication results from the impossibility of obtaining perfect 
measurements of the response of any real system. The inevitable sensor errors 
are usually included as additive measurement noise in the dynamic model. Once 
this noise is introduced, the theoretical nature of the problem changes drasti- 
cally. It is no longer possible tc exactly identify the values of the unknown 
parameters; instead, the values must be estimated by some statistical criterion. 
The theory of estimation in the presence of measurement noise is relatively 
straightforward for a system with discrete time observations, requiring only 
basic probability. 

The second complication of real systems is the presence of state noise. 

State noise is random excitation of the system from unmeasured sources, the 
standard example for the aircraft stability and control problem being 
atmospheric turbulence. If state noise is present and measurement noise is 
neglected, the analysis results in the regression algorithm. 

When both state and measurement noise are considered, the problem is more 
complex than in the cases that have only state noise or only measurement noise. 
Reference 14 develops a mixed continuous/discrete maximum likelihood formulation 
that allows for both state and measurement noise. This formulation has a con- 
tinuous system model with discrete sampled observations. 
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The final problem for real systems is modeling* It has been assumed through- 
out the above discussion that for some value (called the "correct" value) of the 
unknown parameter vector, the system is correctly described by the dynaadc model* 
Physical systems are seldom described exactly by single dynamic models, so the 
question of modeling error arises. No comprehensive theory of modeling error is 
available. The most common approach is to ignore it: Any modeling error is 

simply treated as state noise or measurement noise, or both, in spite of the 
fact that the modeling error may be deterministic rather than random. The 
assumed noise statistics can then be adjusted to include the contribution of 
the modeling error. This procedure is not rigorously justifiable, but, combined 
with a carefully chosen model, it is probably the best approach available. 

With the above discussion in mind, it is possible to make a more precise, 
mathematically probabilistic statement of the parameter estimation problem. The 
first step is to define the general system model (aircraft equations of motion). 
This model can be written in the continuous/discrete form as 

x(to) * xo (1) 

i(t) - f[x(t),u(t),£] + F(£)n(t) (2) 

z(ti) - g[x(ti),u(ti),ei + GU)hi (3) 

where x is the state vector, z is the observation vector, f and g are system 
state and observation functions, u is the known control input vector, £ is the 
unknown parameter vector, n is the state noise vector, and f) is the measurement 
noise vector. The state noise vector is assumed to be zero-mean white Gaussian 
and stationary, and the measurement noise vector is assumed to be a sequence of 
independent Gaussian randou variables with zero mean and identity covariance. 

For each possible estimate of the unknown parameters, a probability that the 
aircraft response time histories attain values near the observed values can then 
be defined. The maximum likelihood estimates are defined as those that maximize 
this probability. Maximum likelihood estimation has many desirable statistical 
characteristics: for example, it yields asymptotically unbiased, consistent, and 
efficient estimates (Ref. 15). 

If there is no state noise and the matrix G is known, then the maximum 
likelihood estimator minimizes the cost function 

J(£) 2 [sE(ti) - z^(t^) ]*(GG*)”1 (z(ti) - z^tti)] (4) 

i*1 

where GG* is the measurement noise covariance matrix, and z^(ti) is the computed 
response estimate of z at t^ for a given value of the unknown parameter vector 

£. The cos t function is a function of the difference between the measured and 
computed time histories. 
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If Eqs. (2) and (3) are linearizsd (as is the case for the stability and 
control derivatives in the aircraft problem), 


x(to) 

- XQ 

(5) 

x(t) 

* Ax(t) + Bu( t) + Fn( t) 

(€) 

z(ti) 

■ Cx(ti) + Dutt^) + GTii 

(7) 


For the* no-state-noise case, 

x$(t 0 ) 
*e< ti+i> 

h (t i> 


the term of Bq. (4) can be approximated by 


x 0 («) 


(8) 

♦«5(ti) 

+ 'l'tu(ti) + u(t i+1 )]/2 

(9) 

Cx^tt*) 

+ Du(ti) 

(10) 


where 


♦ 

* 


exp [A(t i+1 - t ± ) ] 



exp (At) dT o 


When state noise is important, the nonlinear form of Eqs. (1) to (3) is 
intractable. For the linear model defined by Eqs. (5) to (7), the cost function 
that accounts for state noise is 


J(U * 1 E lz(ti) - Z£(t< ))*R- 1 lz(ti) - zr(ti)] + 1 N In |r| (11) 

i«1 2 

where R is the innovation covariance matrix# The Z£(t^) term in Bq# (11) 
is the Kalman-filtered estimate of z, which f if the state noise covariance 
is zero, reduces to the form of Bq. (4)# If there is no state noise, 
the second term of Eq. (11) is of no consequence (unless one wishes to include 
elements of the G matrix as unknowns), and R can be replaced by GG* which makes 
Eq. (11) the same as Eq. (4). 

To minimize the cost function J(£)# we can apply the Hewton-Raphson 
algorithm which chooses successive estimates of the vector of unknown coef- 

A A 

ficients, £. Let L be the iteration number. The L + 1 estimate of 5 is then 
obtained from the L estimate as follows: 

£l+i - £l - tv|j(£ L ) )-i [v|j(i L ) ] d2) 
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The first and second gradients are defined as 


N ^ 

V £J(S> = - 2 [z(ti) - zsttiHMGG*)- 1 [V^z € ( t A ) ] (13) 

i*1 

2 N 

^J(C) * 2 [V 5 z C (t i )]*(GG*)- 1 [V 5 2 ? (t i )l 
i=1 

N 

- 2 [z(ti) - z e (ti)]MGG*)-1[vf zgtti)] (14a) 

i=1 


The Gauss-Newton approximation to the second gradient is 

2 N 

V<£) 2 J IV^z^ttiJJ^tGG^J-’lV^z^tti)] 
i=1 


(14b) 


The Gauss-Newton approximation, which is sometimes referred to as modified 
Newton-Raphson, is computationally much easier than the Newton-Raphson approxi- 
mation because the second gradient of the innovation never needs to be calcu- 
lated* In addition, it can have the advantage of speeding the convergence of 
the algorithm, as is discussed in the SIMPLE AIRCRAFT EXAMPLE section. 

Figure 1 illustrates the maximum likelihood estimation concept. The meas- 
ured response of the aircraft is compared with the estimated response, and the 
•.ifference between these responses is called the response error. The cost func- 
tions of Eqs. (4) and (11) include this response error. The Gauss-Newton com- 
putational algorithm is used to find the coefficient values that maximize the 
cost function. Each iteration of this algorithm provides a new estimate of the 
unknown coefficients on the basis of the response error. These new estimates of 
the coefficients are then used to update the mathematical model of the aircraft, 
providing a new estimated response and, therefore, a new response error. The 
updating of the mathematical model continues iterati/ely until a convergence 
criterion is satisfied. The estimates resulting from this procedure are the 
maximum likelihood estimates. 

The maximum likelihood estimator also provides a measure of the reliability 
of each estimate based on the information obtained from each dynamic maneuver. 
This measure of the reliability, analogous to the standard deviation, is called 
the Cramer-Rao bound (Ref. 16) or the uncertainty level. The Cramer-Rao bound 
as computed by current programs should generally be used as a measure of rela- 
tive accuracy rather than absolute accuracy. The bound is obtained from the 
approximation of the information matrix, H. This matrix equals the approxima- 
tion to the second gradient given by Eg. (14b). The bound for each unknown is 
the square root of the corresponding diagonal element of H. That is, for 

the ith unknown, the Cramer-Rao bound is /H(i,i) . 
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The Maine-Iliff formulation (Ref, 14) and minimization algorithm discussed 
above are implemented with the Iliff-Maine code (MMLE3 maximum likelihood esti- 
mation program). The program and computational algorithms are described fully 
in Ref, 17, All the computations shown and described in the remainder of the 
paper use the algo-rithms exactly as described in Ref. 17. 


AIRCRAFT EQUATIONS OF MOTION 


For the discussion tnat follows in later sections of this paper, some ’ owl- 
edge of the airjraft equations of motion is assumed. To clarify some of tnc*i 
discussion, the aircraft equations are discussed briefly in this section. 

First, the axis system on which the aircraft equations of motion are based 
xs discuss id* Figure 2(a) shows thj aircraft reference xxly-axis system and the 
conventional control r^rfaces. The origin of the body-axis system is at the 
center of gravity. The sign convention for this axis system is defined by the 
right-hand rule with the x-axis defined as positive forward on the aircraft* 

The longitudinal acceleration (a x ) and nondimensional axial force coefficient 

(Cy) are defined along this axis, and the roll rate (p) and rolling moment (L 1 ) 

are defined about this axis* The y-axis is defined as positive out the right 

wing. The lateral acceleration (ay) and nondime'isional side force coefficient 

(Cy) are defined along this axis, and the pitch rate (q) and pitching moment 

(M' ) are defined about this axis. The z-axis is defined as positive out the 

bottom of the aircraft. The normal acceleration (a 2 ) and nondimensional normal 
force coefficient (Cg) are defined along this axis, and the yaw rate (r) and 

yawing moment (N* ) are defined about this axis* The normal acceleration is 
sometimes defined is positive upward but is then referred to as a^« The three 
moments (L*, M 1 , and N') are usually divided by the corresponding moments of 
inertia (I x , Iy, and I 2 }, and are then referred to without the prime as L, M, 

and N. These quantities are nondimensionalized (C 4 , C m , and C n , respectively) 

for use in tb- equations of motion soon to be discussed. The primary control 
about t;v- roll axis (x-axis) is the aileron ( 6 a ), about the pitch axis (y-axis) 

is the ele/ator ( 6 e ), and about the yaw axis (z-axis) is the rudder ( 6 r ). Some 
aircraft have other controls, but in this paper these will only be defined where 
they are discussed (the reaction control jets on the space shuttle, for example). 

The Euler angles ♦, 0, and + define the aircraft attitude with respect to 
the earth. These angles define the rotations which transform earth-fixed axes 
to the aircraft reference body-axis system of Fig* 2(a)* . The order of rotation 
must be about the z-axis (♦), then the y-axis ( 6 ), and finally the x-axis ($) 
for the aircraft equations of motion that will be written subsequently* 

For stability and control analysis, the velocity of the aircraft with 
respect to the air (not with respect to the earth) is of primary interest* 

Figure 2(b) shows the relationship between the aircraft axis system and the flow 
angles* The flow angle in the x-z plane is the angle of attack (a), and the 
flow angle in the x-y plane is the angle of sideslip (3). A more rigorous and 
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detailed definition is required for the derivation of the equations of motion, 
but the above definitions are sufficient to define the following equation of 
motion. 

Generalized nonlinear equations of motion are given in detail in Ref. 17, 
which fully describes the Iliff-Haine code (MMLE3 program). All computations 
an' 1 aircraft examples in this paper use the linearized form for the lateral- 
directional equations. These equations are given below and referred to in the 
remainder of the paper. 


where 


where the 6 
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The state, control, and observation vectors for the lateral-directional node 
can then be defined as 


X = (8 p r <fr)* 

(29) 

u = (6 a 6 r )* 

(30) 

• • 

z * < e m Pm J'm ♦* *y m Pm r m>* 

(31) 


SIMPLE AIRCRAFT EXAMPLE 


The basic concepts involved in a parameter estimation problem can be illus- 
trated by using a simple example representative of a realistic aircraft problem. 
The example chosen here is representative of an aircraft that exhibits pure 
rolling motion from an aileron input. Hiis example, although simplified, typi- 
fies the motion exhibited by many aircraft in particular flight regimes, such as 
the F-14 aircraft flying at high dynamic pressure, the F-111 aircraft at moder- 
ate speeds with the wing in the forward position, and the T-37 aircraft at low 
speed. 

Derivation of an equation describing this motion is straightforward. 

Figure 2(c) shows a sketch of an aircraft with the x-axis perpendicular to the 
plane cf the figure (positive forward on the aircraft). The rolling moment (L'), 
roll rate (p), and aileron deflection (6 a ) are positive as shown. For this 

example, the only state is p and the only control is 6 a » The result of summing 
moments is 

IxP-L'(p,fia) (32) 

The first-order Taylor expansion then becomes 

p - Lpp + L$ a « a (33) 

where 

L'« I X L 

Since the aileron is the only control, it is notationally simpler to use 6 
Instead of 6 a for the discussion of this example. Equation (33) can then be 
written as 

P - Lpp + l$6 (34) 

; alternate approach that results in the same equation is to combine Eq. (16) 
with £q. (20), substituting for C;, end then eliminate the terms that are zero 
for our example. This yields 

pl x = gab Cf p + Cj^e (35) 
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where p is the roll rate and 6 is the aileron deflection. Rearranging terms# 
the equation can be put into the dimensional derivative form of En. (34). 

Equation (34) is a simple aircraft equation where the forcing function is 
provided by the aileron and the damping by the damping-in-roll term, Ip. In 

subsequent sections we examine in detail the parameter estimation problem where 
Bq. (34) describes the system. For this single-degree -of -freedom problem, the 
maximum likelihood estimator is used to estimate either Lp or ox both for a 
given computed time history* 

We will assume that the system has measurement noise, but no state noise as 
in Eqs. (1), (2), and (3). Equation (4) then gives the cost function for maximum 
likelihood estimation. The weigh cing QG* is unisqportant for this problem, so 
let it equal 1. For our example, Eqs. (2) and (3) become x^ * p^ and z± * x^. 
Therefore, Bq. (4) becomes 


J(Ip,L<$) 



fpi - pi(V l 5 )j 2 


(36) 


where pi is the value of the measured response p at time t^ and Pj.(Ip,I>6) is the 

a a 

computed time history of p at time t^ for Ip * Ip and Throughout the 

rest of the paper, where computed data (not flight data) are used, the measured 

time history refers to p^, and the computed time history refers to Pi(Ip,L()« 

The computed time history is a function of the current estimates of lp and L$, 
but the measured tine history is not. 


The most straightforward method of obtaining pi is with Eqs. (3) and (8). 

In terms of the notation stated above, 

Pi+1 * ♦Pi ♦ ♦ (*! + *i+1>/ 2 (37) 


where 


♦ = exp (IpA) 


f A 1*6(1 - e*P (WO! 

♦ * I exp (LpT) dx • ; * 

J o r 

and A is the length of the sample interval (t^+i - t^). Simplifying the 
notation 


then 


*1+1/2 * r *i + *1+1 )/2 
Pi+1 = ♦Pi + ^ 1 + 1/2 


(38) 


(39) 
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The aaxinum likelihood estimate is obtained by ainimizing Eq. (36). The 
Gauss-Newton method described earlier is used for this miniaization. Equa- 
tion (12) is used to determine successive values of the estimates of the 
unknowns during the minimization. 

a A A A A 

Por this simple problem, £ * [Lp L$]* and successive estimates of Lp e.id 

are determined by updating Bq. (12). The first and second gradients of Bq. (12) 
are defined by Bqs. (13) and (14). The complete set of equations is given in 
Ref. 17. 

The entire procedure can now be written for obtaining the maximum likelihood 
estimates for this simple example. To start the algorithm, an initial estimate 

A 

of Lp and L$ is needed. This is the value of Co* With Bq. (12), Ci end sub- 

sequently Cl are defined by using the first and second gradients of J(Lp,L£) 

from Eq. (36). The gradients for this particular example from Eq. (13) and 
'14b) are 

„ N ~ 

V£J(£l> * -2 ( Pi - Pi>*£ Pi (40) 

i*1 

N 

V^J(i L ) H 2 (41) 

i*1 

With the specific equations defined in this section for this simple example, 
we can now proceed in the next section to the computational details of a speci- 
fic example. 


Computational Details of Minimization 

In the previous section we specified the equations for a simple example and 
described the procedure for obtaining estimates of the unknowns from a dynamic 
maneuver* In this section we give the computational details for obtaining the 
estimates. Some of the basic concepts of parameter estimation are best shown 
with computed data where the correct answers are known. Therefore, in this sec- 
tion we study two examples involving computed time histories. The first example 
is based on data that have no measurement noise, which results in estimates that 
are the same as the correct value. The second example contains significant 
measurement noise; consequently, the estimates are not the same as the correct 
values. Throughout the rest of the paper, wLere computed data are used, the 
term "no-noise case" is used for the case with no noise added and "noisy case" 
for the case where noise has been added. 

Since we are studying a simple computed example, it is desirable to keep it 
simple enough to complete some or all of the calculations on a home computer or, 
wit* some labor, on a calculator. With this in mind, the number of data points 
netds to be kept sn^ll. Por this computed example, 10 points (time samples) are 
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used. Tl.e simulated data, which we refer to as the Measured data, are based on 
Eq. (34). Me use the same correct values of Lp and Lj (-0.2500 and 10.0, 
respectively) for both examples. In addition, the same input (6) is used for 
both e> Moles . the saaple interval (A) is 0.2 sec, and the initial condi tiona 
are zeic- Tables of all the significant interaediate values are given with each 
example. These values are given to four significant digits, although to obtain 
exactly tl saae values with a computer or calculator requires the use of 13 
signified- digits, as in the computation of these tables. If the four-digit 
nua- er t are used in the computation, the answers will be a few tenths of a 
percent off, but will still serve to illustrate the minimization accuracy. In 

A 

both examples, the initial values of Lp and L$ (or are -0e5 and 15.0, 
respectively# 

Example With No Measurement Noise 

The leasarement time history for no measurement noise (no-noise case) is 
shown in Fig. 3# The aileron input starts at zero, goes to a fixed value, and 
then returns to zero. The resulting roxl-cate time history is also shown. Hie 
values of the measured roll rate to 13 sigrificant digits are given in Table 1 
along with the aileron input. 

A A 

Table 2 shows the values for Lp, L$, and J for each iteration, along with 
the values of A a..d 4> needed for calculations of p£. in three iterations the 
algorithm converges to the correct values to four significant digits for both Lp 

A 

and L$. L$ overshoots slightly on the first iteration and then cones quickly to 

at 

the correct answer. Lp overshoots slightly on the second iteration. 

Figure 4 shows the match between the measured data and the computed data for 
each of the first three terations. The match is very good after two iterations. 
The match is nearly exact after three iterations. 

Although the .a lgorithm has converged to four-digit accuracy in Lp and L$, 

the value of the cost function, J, continues to decrease rapidly between itera- 
tions 3 and 4. This is a consequence of using the maximum likelihood estimator 
on data with no measurement noise. Theoretically, using infinite accuracy the 
value of J at the minimum should le zero. However, with finite accuracy the 
value of J becomes small but never quite zero. This value is a function of the 
number of significant digits that are being used. For the 13-digit accuracy 

o o 

used here, the cost eventually decreases to approximately 0.3 * 10~*. 

Example With Measurement Noise 

The data used _n this example (noisy case) are the same as those used in the 
previous section, except that pseudo-Gaussian noise has been added to the roll 
rate. The time history i*. a .iorn in Fig. 5. The signal-to-noise ratio is quite 
low in thi~ example, as is readily apparent by comparing Figs. 3 and 5. The 

exact values of the time history to 13-digit accuracy are shown in ^able 3. The 

A A 

values of Lp, L£, $, 1>, and J are shown for each iteration in Table 4. The 
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algorithm converges in four iterations. The behavior of the coefficients as 
they approach convergence is much like the no-noise case. The most notable 

A A 

results of this case are the converged values of Lp and L§, which are somewhat 
different from the correct values. The match between the measured and computed 
time h: tc-y i* shown in Fig. 6 for each iteration. No change in the match is 
apparent for the last two iterations. The match is very good considering the 
amount of measurement noise. 

In Fig. 7, the computed time history lor the correct values of Lp and L$ is 
compared to that for the noisy-case estimates of ’ , and L$. Because the 
algorithm converged to values somewhat different w an the correct values, the 
two compute . time histories are similar but not identical. 

The accuracy of the converged elements can be assessed by looking at the 
Cramer-Rao inequality (Refs. 16 and 17) discussed earlier. The Cramer-Rao bound 
can be obtained from the following approximation to the information matrix. 

H * 2(J, iniaum )(v|j>-V(H-1) 


The Cramer-Rao bounds for Lp and L$ are the square roots of the diagonal ele- 
ments of the H matrix, or /H(1,1) and /h( 2,2), respectively. The Cramer-Rao 

A A 

bounds are 0*1593 and 1.116 for Ip and L$, respectively. The errors in Lp and 
Lfi are less than the bounds. 


Cost Functions 


In the previous section we obtained the maximum likelihood estimates for 
computed time histories by minimizing the values of the cost function. To fully 
understand what occurs in this minimization, we must study in more detail the 
form of the cost functions a.id some of their more important characteristics. In 
this section, the cost function for the no-noise case is discussed briefly. The 
cost function of the noisy case is then discussed in more detail. The same two 
time histories studied in the previous section are s-amined here. The noisy 
case is more interesting because it has a meaningful Cramer-Rao bound and is 
more representative of aircraft flight data. 

First we will look at the one-dimensional case where L$ is fixed at the 

correct value, because it is easier to grasp some of the characteristics of the 
cost function in one dimension. Than we will look at the two-dimensional case, 
where both Lp and Lj are varying. It is important to remember that everything 

shown in this paper on cost functions is based on computed time histories that 
are defined by Eq. (36). For every time history we might choose (computed or 
flight data), a complete cost function is defined. For the case of n variables, 
the cost function defines a hypersurface of n + 1 dimensions. It might occur to 
us that we could just construct this surface and look for the minimum, avoiding 
the need to bother with the minimization algorithm* This is not a reasonable 
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approach because, in general, the number of variables is greater than two* 
Therefore, the cost function can be described mathematically but not pictured 
graphically. 


One-Dimensional Case 

To illustrate the many interesting aspects of cost functions, it is eas- 
iest to first look at cost functions having one variable. In an earlier sec- 
tion, the cost function of Lp and Lg was minimized* That cost function is most 

interesting in the Lp direction. Ther^Jore, the one-variable cost function 
studied here is J(Lp). All subsequent discussions are for J(Lp) with Lg equal 

to the correct value of 10. Figure 8 shows the cost function plotted as a func- 
tion of Lp for the case where there is no measurement noise (no-noise case). As 

expected for this case, the minimum cost is zero and occurs at the correct value 
of Lp s -0.2500. It is apparent that the cost increases much more slowly for a 

more negative Lp than for a positive Lp. In fact, the slope of the curve tends 
to become less negative where Lp is more negative than -1.0. Physically this 
makes sense since the more negative values of Lp represent cases of high damping, 
and the positive Lp represents an unstable system. Therefore, the Pi for posi- 
tive Lp becomes increasingly different from the measured time history for small 
positive increments in Lp. For very large damping (very negative Lp) the system 

would show essentially no response. Therefore, large increases in damping 
result in relatively small changes in the value of J(Lp). 

In Fig. 9, the cost function based on the time history with measurement 
noise (noisy case) is plotted as a function of Lp. The correct value of 
Lp (-0.2500) and the value of Lp (-0.3218) at the minimum of the cost (3.335) 

are both indicated on the figure. The general shape of the cost function in 
Fig. 9 is similar to that shown in Fig. 8. Figure 10 shows the comparison 
between the cost functions based on the time histories with and without measure- 
ment noise. The comments relating to the cost function of the no-noist. ^ase 
also apply to the cost function based on the noisy case. Figure 10 shows 
clearly that the two cost functions are shifted by the difference the value 
of Lp at the minimum and increased by the difference in the minimum cost. One 

would expect only a small difference in the value of the cost when far from the 
minimum. This is because the "estimated" time history is so far from the 
measured time history that it becomes irrelevent as to whether the measured time 
history has noise added* Therefore, for large values of cost, the difference in 
the two cost functions should be small in comparison to the total cost. 

Figure 11 shows the gradient of J(Lp) plotted as a function of Lp for the 

noisy case. This is the function for which we were trying to find the zero (or 
equivalently, the minimum of the cost function) using the Gauss -Newton method of 
a previous section. The gradient is zero at L p * -0.3218, which corresponds to 
the value of the minimum of J(Lp). 

The difference between the Newton-Raphson method (Eq. (14a)) and the 
Gauss-Newton method (Eq. (14b)) of minimization has been mentioned previously. 
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For this simple one -dimensional case, we can easily compute the second gradient 
both with the second term of Eq. (14a) (Newton-Raphson), and without the second 
term (Gauss -Newton, Eq. (14b)). Figure 12 shows a comparison between the 
Newton-Raphson and the Gauss -Newton approximation second gradients. The 
Gauss-Newton second gradient (dashed line) always remains positive because it 
is the sum of quadratic terms (squared for the one-dimensional example). The 
Newton-Raphson second gradient can be positive or negative, depending upon the 
value of the second partial with respect to Lp. Other than the difference in 
sign for the more negative Lp, the two curves have similar shapes. 

As stated earlier, the Gauss-Newton method can be shown to be superior to 
Newton-Raphson in certain cases. We can demonstrate obvious cases of this 
with our example. An easy way to select a spot where problems with the Newton- 
Raphson method will occur is to look for places where the second gradient (slope 
of the gradient) is near zero o^ negative. Figure 11 has such a region near 
Lp = -1.0. If we choose a point where the gradient slope is exactly zero, we 

are forced to divide by zero in Eq. (12) with the Newton-Raphson meti ,jd. This 
point is at Lp = -1.13 in Fig. 12. If the value of the slope of the gradient is 

negative, then the Newton-Raphson method will go to very negative values of Lp. 
For very negative values of Lp, the cost becomes asymptotically constant and the 
gradient becomes nearly zero. In that region, the Newton-Raphson algorithm 
would diverge towards negative infinity. If the slope of the gradient is 
positive but small, we still have a problem with the Newton-Raphson me < hod. 

Figure 13 shows the first iteration starting from Lp « -0.95 for both Gauss- 

Newton and Newton-Raphson. The Newton-Raphson method selects a point where the 
tangent of the gradient at Lp * -0.95 intersects the zero line. This results in 

the selection of an Lp of approximately 2.6 in the first iteration. From that 
value it requires many iterations to return to the actual minimum. On the other 
hand, the Gauss-Newton method selects a value for Lp of approximately -0.09 and 

converges to the minimum to four-digit accuracy in two more iterations. With 
more complex examples a comparison of the convergence properties of the two 
algorithms becomes more difficult to visualize, but the problems are generaliza- 
tions of the situation we have observed with the one-dimensional example. 

The usefulness of the Cram&r-Rao bound was discussed in the Example With 
Measurement Noise section. At this point it is useful to digress briefly to 
discuss some of the ramifications of the Cramer-Rao bound for the one-dimensional 
case. The Cramer-Rao bound only has meaning for the noisy case. In the noisy 
example, the estimate of Lp is -0.3218 and the Cramer-Rao bound is 0.0579. The 

calculation of the Cramer-Rao bound was defined in the previous section for both 
one-dimensional and two-dimensional examples. The Cramer-Rao bound is an esti- 
mate of the standard deviation of the estimate. One would expect the scatter in 
the estimates of Lp to be of about the same magnitude as the estimate of the 

standard deviation. For the one-dimensional case discussed here, the range 
(Lp (-0.3218) plus or minus the Cramer-Rao bound (0.0579)) nearly includes the 

correct value of Lp (-0.2500). If noisy cases are generated for many time his- 
tories (adding different measurement noise to each time history), then the sam- 
ple mean and sample standard deviation of the estimates for these cases can be 
calculated. Table 5 gives the sample mean, sample standard deviation, and the 
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standard deviation of the sample mean (standard deviation divided by the square 
root of the number of cases) for 5, 10, and 20 cases* The sample mean, as 

expected, gets closer to the correct value of -0*2500 as the number of cases 
increases. This is also reflected by the decreasing values in column 4 of 
Table 5, which are estimates of the error in the sample mean* Column 3 of 
Table 5 shows the sample standard deviations, which indicate the approximate 
accuracy of the individual estimates* This standard deviation, which stays mora 
or less constant, is approximately equal to tne Cramer-Rao bound for the noisy 
case being studied here. In fact, the Cramer-Rao bounds for each of the 20 
noisy cases used here (not shown in the table) do not change much from the 
values found for the noisy case being studied. Both of these results are in 
good agreement with the theoretical characteristics (Ref. 16) of the Cramer-Rao 
bounds and maximum likelih od estimators in general. 

The examples shown here indicate the value of obtaining more sample time 
histories (maneuvers). More samples improve confidence in the estimate of the 
unknowns. The same result holds true in analyzing actual flight time histories 
(maneuvers); thus it is always advisable to obtain several maneuvers at a given 
flight condition to improve the best estimate of each derivative. 

The size of the Cramer-Rao bounds and of the error between the correct value 
and the estimated value of Lp is determined to a large extent by the length of 
the time history and the amount of noise added to the correct time history. For 
the example being studied here, it is apparent from Fig. 5 that the amount of 
noise being added to the time history is large. The effect of the power of the 

A 

measurement noise (GG*, Eqs. (3) and (4)) on the estimate of Lp (that is, Lp) 
for the time history is given in Table 6. The estimate of Lp is much improved 

by decreasing the measurement noise power. A reduction in the value of G to 
one-tenth of the valu^ in the noisy example being studied yields an acceptable 
estimate of Lp. For l light data, the measurement noise is reduced by improving 
the accuracy of the output of the measurement sensors. 

Two-Dimensional Case 

In this section the cost function (which is deiendent on both l p and i»6 ) is 
studied. The no-noise case is examined first, followed by the noisy case. 

No-noise case . Even though the cost function is a function of only two 
unknowns, it is much more difficult to visualize than the one-unknown case. The 
cost function over a reasonable range of Lp and L$ is shown in Fig. 14. The 

cost increases very rapidly in the region of positive Lp and large values of 
Lj. The reason is just an extension of the argument fcr positive Lp given in 
the previous section. The shape of the surface can be depicted in greater 
detail if we examine only the values of the cost function less than 200 for Lp 

less than 1.0. Figure 15 shows a view of this restricted surface from the upper 
end of the surface. The minimum must lie in the curving valley that gets 
broader as we go to the far side of the surface. Now that we have a picture of 
the surface, we can look at the isoclines of constant cost on the Lp-versus-Lj 

plane. These isoclines are shown in Fig. 16. The minimum of the cost function 
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is inside the closed isocline. The steepness of the cost function in the 
poeitive-Lp direction is once again apparent. Inside the closed isocline the 

shape is more nearly elliptical, indicating chat the cost is nearly quadratic 
here, so fairly rapid convergence in this region would be expected. The Lp axis 

becomes an asymptote in cost as L 5 approaches zero. The cost is constant for 
L5 - 0 because no response would result from any aileron input. The estimated 

response is zero for all values of Lp, resulting in constant cost. 

Figure 16 shows the region of the minimum value of the cost function, which, 
as seen in the earlier example (Table 2), occurs at the correct values for Lp 
and L$ of -0.2500 and 10, respectively. This is also evident by looking at the 

cost function surface shown in Fig. 17. The surface has its minimum at the 
correct value. As expected, the value of the cost function at the minimum is 
zero. 

Noisy case . As shown before in the one-dimensional case, the primary dif- 
ference between the cost functions for the no-noise and noisy cases was a shift 
in the cost function. In that instance, the noisy case was shifted so that the 
minimum was at a higher cost and a more negative value of Lp. In the two- 

dimensional case, the no-noise and noisy cost functions exhibit a similar shift. 
For two dimensions the shift is in both the Lp and L$ directions. The shift is 

small enough that the difference between the two cost functions is not visible 
at the scale shown in Fig. 14 or from the perspective of Fig. 15. Figure 18 
shows the isoclines of constant cost for the noisy case. The figure looks much 
like the isoclines for the no-noise case shown in Fig. 16. The difference 
between Figs. 16 and 18 is a shift in Lp of about 0.1. This is the difference 

in the value of Lp at the minimum for the no-noise and noisy cases. Heuristi- 

cally, one can see that the same would be true for cases with more than two 
unknowns. The primary difference between the two cost functions is near the 
minimum. 

The next logical part of the cost function to examine is near the minimum. 
Figure 19 shows the same view of the cost function for the noisy case as was 
shown in Fig. 17 for the no-noise case. The shape is roughly the same as that 

shown in Fig. 17, but the surface is shifted such that its minimum lies over 

Lp ■= -0.3540 and L$ * 10.24, and is shifted upward to a cost function value of 

approximately 3.3. 

To get a more precise idea of the cost of the noisy case near the minimum, 

we once again need to examine the isoclines. The isoclines (Fig. 20) in this 

region are much more like ellipses than they are ir Fig*>- 16 and 18. We can 
follow the path of the minimization example used before by including the results 
from Table 4 on Fi^. 20. The first iteration (L * 1 ) brought the values of Lp 
and 1*5 very close to the values at the minimum. The next iteration essentially 

selected the values at the minimum when viewed at this scale. One of the rea- 
sons the convergence is so rapid in this region is that the isoclines are nearly 
elliptical, demonstrating that the cost is very nearly quadratic in this region. 
If we had started the Gauss-Newton algorithm at a point where the isoclines 
are tfuch less elliptical (as in some of the border regions in Fig. 18), the 
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convergence would have been much slower initially, but much the same as it 
entered the nearly quadratic region of the cost function. 


Before concluding our examination of the two-dimensional case, we need to 
examine the Cramer-Rao bound. Figure 21 shows the uncertainty ellipsoid, which 

is based on the Cramer-Rao bounds defined in an earlier section. The relation- 
ships between the Cramer-Rao bound and the uncertainty ellipsoid are discuss ;d 
in Ref. 16. The uncertainty ellipsoid almost includes the correct value of Lp 

and Ii< 5 . The Cramer-Rao bound for Lp and L$ can be determined from the projec- 
tion of the uncertainty ellipsoid onto the Lp and L 5 axes, and compared with the 
values given earlier, which were 0.1593 and 1.116 for Lp and L$, respectively. 

ESTIMATION USING FLIGHT DATA 


In the previous several sections we examined the basic mechanics of obtain- 
ing maximum likelihood estimates from computed examples with one or two unknown 
parameters. Now that we have a grasp of these basics, we can explore the esti- 
mation of stability and control derivatives from actual flight data. For the 
computationally much more difficult situation usually encountered using actual- 
flight data, we will obtain the maximum likelihood estimates with th« Ilirf- 
Maine code (MMLE3 program) described in Ref. 17, The equations of motion that 
are of interest are given in the AIRCRAFT EQUATIONS OF MOTION section of this 
paper; the remainder of the equations are given in Ref. 17. 

In general, flight data estimation is fairly complex, and codes as the 

Iliff-Maine code must usually be used to assist in the analysis* However, one 
must still be cautious about accepting the results; that is# the estimates must 
fit the phenomenology, and the match between the measured and computed time his- 
tories must be acceptable. This is true in all flight regimes, but one must be 
particularly careful in potential problem situations such as ( 1 ) in separated 
flow at high Mach numbers or high angle of attack, (2) with unusual aircraft 
configurations such as the oblique wing (Ref. 18), or (3) with modern high- 
performance aircraft with high-gain feedback loops. In any of the above cases, 
one should be particularly careful where there are even small anomalies in the 
match. These anomalies may indicate ignored terms in the equations of motion, 
separated flow, nonlinearities, sensor problems, insufficient resolution 
(Ref. 1), sensor location (Ref. 1), time or phase lags (Refs. 1 and *9), or aay 
of a long list of ocher problems. 

The following brief examples are intended to show how the above caveats and 
the computed examples of previous sections can be used to assist in the analy- 
sis. Ii. the computed example, the desirability of low-noise sensors, an ade- 
quate model, and several maneuvers at a given flight condition is shown. 


Hand Calculation Example 

Sometimes evaluation of a fairly complex flight maneuver can be augmented 
with a simple hand calculation. One example of this can be found for the space 
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shuttle. The space shuttle is a large double-delta-winged vehicle designed to 
enter the atmosphere from space and land horizontally. The entry control system 
consists of 12 vertical reaction-control-systen (RCS) jets (six up-firing and 
six down-firing), 8 horizontal RCS jets (four left-firing and four right-firing) 
4 elevon surfaces, a body flap, and a split rudder surface. The locations of 
these devices are shown in Pig. 22. the vertical jets and the elevens are used 
for both pitch and roll control. The jets and elevons are used symae tries lly 
for pitch control and asymmetrically for roll control. The space shuttle con- 
trol system is described briefly in Ref. 6. 

The shuttle example used here is from a maneuver obtained at a Mach number 
of approximately 21 and an angle of attach of approximately 40°. The controls 
being used for this lateral-directional maneuver are the differential elevons 
and the side-firing jets (yaw jets). The maneuver is shown in Fig. 23. 

Equations (15) to (31) describe the equations of motion. A simplified approach 
can be used to determine some of the derivatives by hand. The approach is one 
that has been used since the beginning of dynamic analysis of flight maneuvers. 
In particular, for this maneuver the slope of the rates can be used to determine 
the yaw jet control derivatives. This is possible for this example, even with a 
high-gain feedback system, because the yaw jets are essentially step functions, 
and the slope of the rates p and r can be determined before the vehicle and the 
differential elevon (aileron) responses become significant. The rolling moment 
due to yaw jet (Lyj) is particularly important for the shuttle (Ref. 6 discusses 
the essential nature of flight-determined Lyj in the redefinition of entry 

maneuvers) and is, in general, more difficult to obtain than the more dominant 
yawing moment due to yaw jet. Therefore, as an illustrative example, Lyj is 

determined by hand. Figure 24 shows yaw jet activity and smoothed roll rate 
plotted at expanded scales. The equation for Lyj is given by 

Lyj ■ pl x / (Number of yaw jets) (42) 

$ - Ap/At - ♦ (0.1 ) (43) 


Therefore, given that I x 2 900,000 slug-ft 2 , and the number of yaw jets is 4, 

L yj 2 2750 ft-lb. 

The same maneuver was analyzed with MMLE3, and the resulting match 4 shown 
in Fig. 25# The match is very good except for a small mismatch in p at about 
6 sec. This small mismatch was studied separately with MMLE3 and found to be 
caused by a nonlinearity in the aileron derivative. The value from MMLE3 for 
Lyj is 2690 ft-lb, which for the accuracy used here is essentially the same 

value as obtained by the simplified method. The aileron derivatives would be 
difficult to determine as accurately as the yaw jet derivatives. Although good 
estimates can seldom be obtained with the slope method discussed here, rough 
estimates can usually be obtained to gain some insight into values obtained with 
MMLE3 (or any other maximum likelihood program). These rough estimates can then 
be used to help explain unexpected values of estimates from an estimation 
program. 
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Sometimes a flight example becomes too complex to allow anything other than 
qualitative estimates to be determined by hand* The example shown in Fig. 26 is 
the determination of the rudder derivative for tl e F-8 aircraft with the yaw 
augmentation system on. This example, taken from Ref. 20, includes an aileron 

pulse and a rudder pulse. Although an independent pilot rudder pulse is input 
during the maneuver, the rudder is largely responding to the lateral accelera- 
tion feedback. When the rudder is moving, several other variables are also 
moving, thus making it difficult to use the simplified approach just discussed. 
However, C n ^ can be roughly determined when the rudder moves, approximately 

1.7 sec from the start of the maneuver. Most of the slope of yaw rate 4 8 pro- 
bably caused by the rudder, but a poor estimate would be obtained using the hand 
calculation. 


Cost Function for Full Aircraft Problem 


The analysis of a lateral-directional maneuver obtained in flight typically 
has from 15 to 25 unknown parameters (as shown in Eqs. (15) and (31)), in 
contrast to the one or two in the simple aircraft example. This makes detailed 
examples unwieldy and any graphic presentation of the cost function impossible. 
Therefore, in this section we are primarily examining tl,e estimation procedure 
and the process of the minimization. 

For our flight example, we have chosen a lateral-directional maneuver, with 
both aileron and rudder inputs, that has 17 unknown parameters. The data ar<« 
from the oblique wing aircraft (Ref. 18) with the wing unskewed during the 
maneuver. This example was chosen because it is a typical maneuver. The time 
history of the data and the subsequent output of MMLE3 have been published in 
Ref. 21. Some results of the analysis are shown in Table 7. The match between 
the measured time history (solid *.ines) and the estimated (calculated) time 
history (dashed lines) is shown as a function of iteration in Fig. 27. Fig- 
ures 27(a) to (e) are for iterations 0 to 4, respectively. Table 7 shows that 
the cost remains unchanged after four iterations. A similar result was obtained 
for the two-dimensional simple aircraft example in Fig. 6 and Table 4. 

Of the many things the analyst must consider in obtaining estimates, the two 
most important or.sg are how good is the match and how good is the convergence. 

A satisfactory match and monotor convergence are necessary, but not suffi- 
cient, conditions for a success.' x analysie- Figure 27(e), although not per- 
fect, is a very good match. The convergence can best be evaluated by looking at 
the normalized cost in the last row of Table 7. The cost has converged rapidly 
and monotonically in four iterations, and it remains at the converged cost. 

These factors are convincing evidence that the convergence is complete. There- 
fore. the criteria of match and convergence are satisfied in our example. In 
some cases we might encounter cost that does not converge rapidly (in four to 
six iterations) or monotonically, or stay "exactly" at the minimum value. These 
situations usually indicate at least a small problem in the analysis. These 
problems, if found, are usually traced to an instrumentation or data aquisition 
problem, an inadequate mathematical model, or a maneuver that contains a mar- 
ginal amount of information. 
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Table 7 also shows that the startup values of all the coefficients are zero 
for the control and bias variables* Wind tunnel estimates could have been used 
for starting value ^ but the convergence of the algorithm is not very dependent 

or the startup values. As part of the startup algorithm, the MMLE3 program nor- 
mally holds the derivatives of the state variables constant until after the 
first iteration, is evident in Table 7, 

Figure 27(a) shows the match between the measured and computed data for the 
startup values. The match is very poor because the startup values for the con- 
trol derivatives are all zero, so the only motion is in response to the initial 
conditions. The control derivatives and blase , are determined on the first 
iteration, resulting in the much improved match shown in Fig. 27(b). The match 
after two iterations, shown in Fig. 27(c), is improved as the program further 
modifies the control derivatives and, for the first time, adjusts the deriva- 
tives affecting the natural frequency (C n g and C$g). By the third iteration 

(Fig. 27(d)), the improvement in the match is. almost complete, because minor 
adjustments to the frequency are made and the damping derivatives are changed. 
Fig. 27(e) shows the match when all but the most minor derivatives have ceased 
to change. 

Several general observations can be made based on this well behaved example. 
The strong or most important coefficients hare essentially converged in thr^e 
iterations. The same effect was seen in the simple example — that is, con- 
verged faster than Lp (Table 4). Some of the less important or second-order 
coefficients have only converged to two places after three iterations and are 
still changing by one digit in the fourth place at the end of six iterations. 
Another observation is that for some coefficients (0$^, C n< j , and Cj^ ) even 

though the sign is wrong after the first iteration, the algorithm quickly 
selects their correct values once the important derivatives have stabilized. 

In general, if the analysis of a maneuver has gone well, we do not need to 
spend much time inspecting a table analogous to Table 7. However, i/ there have 
been problems in convergence or in the quality of the fit, a detailed inspection 
of such a table may be necessary. The data may show an important coefficient 
going unstable at an early iteration, which could cause problems later. If ths 
starting values are grossly in error, the algorithm is driven a long way from 
reasonable values and then for many reasons does not behave well. Occasionally 
the algorithm alternately selects from two diverse set of values of two or more 
coefficients on successive iterations, behaving as if the shape of the cost 
function were a narrow multidimensional valley analogous to but more extreme 
than the two-dimensional valley shown in Figs. 18 and 20. 


Cramer-Rao Bounds 


The earlier sections regarding the computed example have shown that the 
Cramer-Rao bound is a good indicator of the accuracy of an estimated parameter. 
The Cramer-Rao bounds can be used in a similar, but somewhat more qualitative, 
fashion on flight data. The Cramer-Rao bounds that are included in MMLE3 (as 
well as many other maximum likelihood estimation programs) have been useful in 


219 



determining whether estimates are good or bad. The aircraft example discussed 
here has b ^n reported previously (for example, in Refs. 1 and 16). However, 

this example of the use of the Cramer-Rao bound in the assessment of flight- 
derived estimates is pertinent to the thrust of this paper. Figure 28 shows 
estimates of C n ^ as a function of angle of attack for the PA-30 twin-engine 

general aviation aircraft (Ref. 22) at three flap settings. There is a signifi- 
cant amount of scatter, which makes the reliability of the information on C n ^ 

questionable. The data shown are the estimates from the MMLE3 program, which 
also provides the Cramer-Rao bounds for each estimate. Past experience (Ref. 1) 
has shown that if the Cramer-Rao bound is multiplied by a scale factor (the 
result sometimes being called the uncertainty le^el (Refs. 1 and 16)) and plot- 
ted as a vertical bar with the associated estimate, it helps in the interpreta- 
tion of flight-determined results. Figure 29 shows the same data as Fig. 28, 
with the uncertainty levels now included as vertical bars. The estimates with 
small uncertainty levels (Cramer-Rao bounds) are the best estimates, as was 
discussed earlier in the section on Cramer-Rao bounds for the one-dimensional 
case. The fairing shown in Fig. 29 goes through the estimates with small 
Cramer-Rao bounds and ignores the estimates with large bounds. One can have 
great confidence in the fairing of the estimates, because the fairing is well 
defined and consistent when the Cramer-Rao bound information is included. In 
this particular instance, the estimates with small bounds were from maneuvers 
where the aileron forced the motion, and the large bounds were from maneuvers 
where the rudder forced the motion. Therefore, in addition to aiding in the 
fairing of the estimates, the Cramer-Rao bounds help show that the aileron- 
forced maneuvers are superior for estimating C n ^ for the PA-30 aircraft. 

This example illut> crates that the Cramer-Rao bounds are a useful tool in 
assessing flight-determined estimates, just as they were found useful for the 
simple aircraft example with computed datae 


Atmospheric Turbulence ( S tate Noise) 

Atmospheric turbulence (state noise) cannot always be avoided in flight; 
therefore, it is desirable to be aole to obtain stability and control deriva- 
tives in the presence of turbulence. In addition, an estimate of the turbulence 
time history can be of interest, particularly in the implementation of tur- 
bulence suppression systems. 

Many years ago it was demonstrated that the stability and control deriva- 
tives can be adequately determined with maximum likelihood estimation techniques 
for maneuvers performed in smooth air. If the^e techniques, which do not 
account for turbulence, are applied to data obtained in turbulence, net only are 
the resulting matches of the time histories unsatisfac tory but the estimated 
coefficients are unacceptable (Refs. 23 to 25). The technique described in 
Refs. 14, 23, and 25 can account for the effect of turbulence. With this tech- 
nique, maximum likelihood estimates of the stability and control derivatives as 
well as estimates of the turbulence time histories are >t>tained by minimizing 
the cost function given by Fq. (11). Results of the application of the tech- 
nique to longitudinal maneuvers obtained in turbulence have been reported 
previously (Refs. 23 to 25). 



Tne lateral-directional equations (Eqs. (15), (16), (17), (18), and (29)) 
can be modified in a Banner similar to that used to modify the longitudinal 
equations in Refs. 23 to 25. The turbulence (state noise) model is the Dryden 
expression, which is described in Ref. 26. The Iliff-Maine code (Ref. 17) can 
be used to obtain the maximum likelihood estimates where state noise is present. 

Thirty-eight seconds of data from the PA-30 aircraft flying in turbulence 
was analyzed at 50 samples/sec. The best match that could be obtained with the 
maximum likelihood estimation method that does not account fcr turbulence is 
shown in Fig. 30. The match is unacceptable and resulted in poor estimates of 
the stability and control derivatives. Figure 31 shows the match obtained with 
the maximum likelihood estimation technique that accounts for turbulence 
(Re' .14 and 17). The match is excellent and the maneuver provided acceptable 
es mated stability and control derivatives. It is also of interest to compare 
the power spectra of the estimated turbulence time histories. The power 

spectrum of the turbulence component affecting angle of sideslip, shown 

in Fig. 32. Figure 33 presents the power spectrum of the turbulence component 
affecting roll rate, Pg. The slopes of the asymptotes shown in Figs. 32 and 33 
are those defined by the Dtyden expression given in Ref. 26. Good agreement is 

a * 

showit between the power spectra and the asymptotes for 8g and pg. 

The algorithm used here is based on a linearized system described by 
Eqs. (5) to (7) and solved by minimizing the cost function given by Bq. (11). 

The system need not resemble that for the aircraft stability and control problem 
other than in the requirement for linearity. Therefore, many formulations for 
the structural problem are written in the form of Bqs. (5) to (7), and the 
algorithm under discussion can be directly applied with these formulations* 


ESTIMATION FOR SIMPLE STRUCTURAL PROBLEM 


The problem of the flexible space structure is most fully characterized as a 
distributed parameter system with its associated distributed system control 
laws. The model will vary depending upon changes in its configuration or its 
environment, such as solar heating. As in most cases, the preferred solution is 
the simplest successful approach. The lumped system approach is much simpler 
and computationally far more efficient chan the fully distributed parameter sys- 
tem approach. For example, structural mode control base^ on current state-of- 
the-art approaches has proved very successful* Admittedly, the aircraft struc- 
ture is heavier than most spacecraft, but many aircraft structures are highly 
complex, consisting of many subetructures within the main structure* To the 
novice, many of the sp^ce structures currently Seing investigated appear simpler 
than modern, large aircraft. If the lumped parameter system approach used for 
rhe aircraft pro* am is found to be inadequate, it seems likely that distributed 
parameter estimation codes will evolve to whatever complexity is necessary to 
sol Vi the flexible space structure problem. 
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This paper has discussed some of the erqterience gained f roe the application 
of aircraft stability and control analysis to flight data. The codes used for 
this analyses are for lumped paraeeter systems in the time domain. The codes 

have been used successfully for structural problems and are fully adaptable to 
the frequency domain if that is found to be preferable. 

Although few results have been obtained for time-domain structural analysis 
at the Ames Dryden Flight Research Facility, some superficial experience in 
structural time-domain analysis has been obtained. The following two examples 
show how the techniques being used for stability and control analysis can be 
applied to simple structural problems. The preceeding section discussed the 
incorporation of state noise in the model. The following examples do not 
include the use of state noise, but state noise, if warranted, could easily be 
incorporated in the types of examples to be discussed. 


Estimation of Structural Characteristics 


All aircraft have observable structural modes. These modes usually cause no 
difficulty in estimating stability and control derivatives because the struc- 
tural frequencies are higher than the aerodynamic frequencies. In general, if 
the structural frequencies are higher than the highest aerodynamic frequency by 
more than a factor of 5 to 10, they can be neglected unless their amplitude is 
so large as to mask measurement? desired for the aerodynamic analysis. However, 
if one or more structural modes are affecting the aerodynamic modes, as may 
occur in large aircraft, these structural nodes must be included in the mathe- 
matical model being analyzed. 

Even though no completely satisfactory practical results are available that 
account for structural modes and their interactions with the aerodynamics , it is 
interesting to assess the time-domain maximum likelihood analysis of the struc- 
tural modes independent of any interaction. This can be done where a structural 
mode is observed and no significant coupling is apparent. 

Figure 34 shows a structural mode on the lateral acceleration of an aircraft 
where little effect was observed for structural-aerodynamic coupling. The fre- 
quency of the mode is high enough that the mode does not interact with the aero- 
dynamic modes. Therefore, the stability and control derivatives were obtained 
separately and held constant for the succeeding analysis. The analysis con- 
sisted of using the maximum likelihood estimation program KMLE 3 (Ref. 17) with 
a sixth-order model that included the lateral-directional aerodynamic modes plus 
one structural mode. The dynamic pressure and the velocity were allowed to vary 
in the analysis. The structural mode frequency and damping were estimated as 
linear functions of dynamic pressure. The initial conditions were also esti- 
mated. A structural mode frequency of 7.84 Hz was chosen to start the estima- 
tor process. The comparison between the original data and the match obtained 
wiu.\ the maximum likelihood estimation method is shown in Fig. 35. The two time 
histories are in good agreement at the beginning of the maneuver and at the end 
of the maneuver, but they are 180° out of phase at a time of approximately 
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0*3 sec. The match shown in Fig* 35 suggests that the maximum likelihood esti- 
mator has reached a local minimum but not the global minimum. Multiple minima 
are not normally a problem when obtaining the stability and control derivatives 
of aircraft with the maximum likelihood estimation method. 

The reason for the multiple minima is demonstrated by the following simple 
scalar example. Let the noiseless measured response be z(t) * sin (u)Qt) the 
estimated response be Z£ * sin (u)t), where w is the only unknown coefficient. 
Then, by Eq. (4), the cost function becomes 

j(u>,t) = f [sin (u)Qt) - sin (wt)]^ dt 
J 0 

= * " 4^ sin (2 "0 t) " i sin <2u> T ) 


2u) w 0 

- — sin (wT) cos (wnT) - cos (wT) sir (unT) 

0)2 _ wq2 w 

If T is chosen to represent 10 cycles, as shown in Fig* 35, then for an u>q of 
1 rad/sec, T equals 20w. in Fig. 36, the cost function J(<i>,20t) is shown as a 
function of a). The global minimum is at an o> of 1 rad/sec, as it should be, but 
there are many local minima at increments of approximately 0*05 rad/sec. If a 
value of less than 0.97 or greater than 1*03 were chosen for a starting estimate 
of (i), the algorithm would converge to a local minimum* If a value of between 
0*98 and 1.02 were chosen, it would converge to the global minimum* Therefore, 
for this example where 10 cycles were observed, the starting value of u> must be 
less than 3 percent from the correct anwser to converge to the global minimum* 

Figure 37 shows a sine wave for the global minimum along with a sine wave 
with a frequency that varies 10 percent from the global minimum. The sine waves 
are in phase at the beginning and end, ana 180° out of phase in the middle* 

These data appear similar to those shown for flight data in Fig. 35. If only 
one or two cycles were used for the analysis, the problem illustrated in Fig. 37 
would be minimized* This is apparent in Fig. 38 where only the first cycle of 
Fig. 37 is shown. 

If T is chosen to represent only one cycle and wq remains equal to 1 rad/sec 

(as in Fig. 38), then T equals 2x* The cost function J(w,2*) is presented as a 

function of u> in Fig. 39* The global minimum is correctly at an w of 1 rad/sec, 

but now the algorithm converges to the global minimum if u> is started within 

approximately 25 percent of the correct value. 

Knowing the sensitivity of the algorithm when a record with many lightly 
damped cycles is being analyzed, the data of Fig. 34 can be reanalyzed starting 
closer to the observed frequency. Starting the maximum likelihood estimation 
method with an u) of 9.0 results in the fit shown in Fig. 40. This is an accep- 
table fit of the data. 
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Based on the preceding results, if data are to be analyzed where many cycles 
of a structural node are present, the structural mode frequency, u>, must be clo- 
sely approximated before starting the estimation process. 


Structural Modes in Space 

In the process ^f analyzing aircraft flight data, the authors have fre- 
quently observed results that clearly exhibit unmodeled dynamics. The unmodeled 
dynamics could be caused by many phenomena, such as higher-order aerodynamic 
modes or structural modes. These modes can usually be ignored and left unmo- 
deled because they have no effect on the results of primary interest in tne ana- 
lysis. If the unmodeled modes cannot be ignored, then the system equations must 
be revised to include the unmodeled modes. 

The authors have not yet found it necessary to model structural modes for 
data obtained in space in the process of obtaining control derivatives for the 
space shuttle. However, the structural modes have been observed. Figure 41 
shows the response of the space shuttle to the firing of a roll jet and a yaw 
jet at an altitude of 430,000 ft. The space shuttle configuration and the loca- 
tion of the RCS jets are shown in Fig. 22. The changes in the rigid-body rates 
and lateral acceleration caused by the jet firings are apparent in Fig. 41. The 
structural modes are also excited by the jets, as evidenced by the increased 
ringing in each signal at the time of the jet firings. The roll jet firing has 
little effect on the rigid-body response for the yaw rate and lateral accelera- 
tion; nowever, the yaw jet results in a rigid-body response for all the signals 
chosen. This maneuver was analyzed to obtain control derivatives for the rigid- 
body response described by Bqs. (15) to (31). The resulting match between the 
measured and computed response is shown in Fig. 42. The estimated control deri- 
vatives are in good agreement with those obtained from the maneuvers. The unmo- 
deled structural dynamic modes are evident, but it is apparent that the modes 
will have little effect on the rigid-body control derivatives. The differences 
between the measured and computed rigid-body responses (the residuals) for the 
time close to when the jets were fired are shown in Fig. 43. The data shown 
here are for a sample interval of 0.006 sec. Some persistent structural 
ringing is shown for the two rates and the lateral acceleration. However, when 
a jet is fired, the increased structural response is evident. The structural 
coefficients can be extracted directly from the residual as they were for the 
example in the previous section. It appears that there may be some contamina- 
tion caused by the rigid-body response at the instant the jets fire. If so, 
this contamination can be eliminated in one of two ways: either analyze the por- 
tion of the maneuver a tenth of a second after the jet fires, or adapt the equa- 
tions of motion to include the structural dynamics in addition to the rigid 
dynamics. The structural dynamics depicted in Fig. 43 have not been analyzed, 
but the procedure is straightforward. The procedure used on this case was the 
same as that used on the example in the proceeding section. It is apparent, 
however, that more than one structural mode would need to be included in the 
model. 

All the analysis techniques discussed in this paper apply to the analysis of 
this space shuttle example. If state noise is included in the mathematical 
model, then the linear form of Bqs. (5) to (7) would be required. In general. 
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if the structural partial differential equation can be expressed in tne linear 
foras of Eqs. (5) to (7) (with or without state noise), the structural nodes can 
be analyzed readily with the MMLE 3 program (Ref, 17) in the tine domain. If 

the analyst prefers, the problem can be expressed in the linear constant coef- 
ficient form and analyzed in the frequency domain, as described in Ref. 12. The 
relative advantages and disadvantages of time-domain analysis as compared with 
frequency-domain analysis are also discussed in that reference. If the equa- 
tions are nonlinear, but in the form of Eqs. (1) to (3), then maximum likelihood 
estimates can be obtained in the time domain. 


CONCLUDING REMARKS 


The computed simple aircraft example showed the basics of minimization and 
tha general concepts of cost functions themselves. In addition, the example 
demonstrated the advantage of low measurement noise, multiple estimates at a 
given condition, and the Cramer-Rao bounds, and the quality of the match between 
the measured and computed data. The flight data showed that many of these con- 
cepts still hold true even though the dimensionality of the cost function makes 
it impossible to plot or visualize. In addition, the techniques used for the 
aircraft problem were shown to be applicable to the flexible structure problem. 
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Table 1 Values of confuted time 
history with no measurement noise 


1 

6, deg 

P# deg/sec 

1 

0 

0 

2 

1 

0.9754115099857 

3 

1 

2.878663149266 

4 

1 

4.689092110779 

5 

1 

6.411225409939 

6 

1 

8.049369277012 

7 

1 

9.607619924937 

8 

0 

10.11446228200 

9 

0 

9.621174135646 

10 

0 

9.151943936071 


Table 2 Pertinent values as a function of iteration 


L 

Lp(D 

Lfi(L) 

♦ (L) 

♦ (L) 

J L 


0 

-0.5000 

15.00 

0.9048 

2.855 

21.21 


1 

-0.30C5 

9.888 

0.9417 

1.919 

0.5191 


2 

-0.2475 

9.996 

0.9517 

1.951 

5.083 x 

10” 4 

3 

-0.2500 

10.00 

0.9512 

1.951 

1.540 x 

10”® 

4 

-0.2500 

10.00 

0.9512 

1.951 

1.060 x 

10” 14 


Table 3 Values of computed time his- 
tory with added measurement noise 


1 

6, deg 

p, deg/sec 

1 

0 

0 

2 

1 

C. 4875521781881 

3 

1 

3.238763570696 

4 

1 

3.429117357944 

5 

1 

6.286297353361 

6 

1 

6.953798550097 

7 

1 

10.80572930119 

8 

0 

9.739367269447 

9 

0 

9.788844525490 

*0 

0 

7.382568353168 
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Table 4 Pertinent values as a function of iteration 


L 

Lp(L) 

bfiU) 

♦a> 

+(L) 


0 

-0.5000 

15.00 

0.9048 

2.855 

30.22 

1 

-0.3842 

10.16 

0.9260 

1.956 

3.497 

2 

-0.3518 

10.23 

0.9321 

1.976 

3.316 

3 

-0.3543 

10.25 

0.9316 

1.978 

3.316 

4 

-0.3542 

10.24 

0.9316 

1.978 

3.316 

5 

-0.3542 

10.24 

0.9316 

1.978 

3.316 


Table 5 

Mean and standard deviations for 

estimate* o* Lp 

Number 

of 

Sample mean. 

Sample standard 

Sample standard 
derivation of the 

cases , 

N 

U(Lp) 

deviation, o{Lp) 

mean, c(Lp)//N~ 

5 


-0.2668 

0.0739 

0.0336 

10 


-0.2511 

0.0620 

0.0196 

20 


-0.2452 

C.0578 

0.0129 


nble 6 Estimate of Lp and Cremer-Rao bound as 
a function of the square root of noise pouar 


Square root of 
noise power 

Estimate 
of Lp 

Cramer-Rao 

bound 

0.0 

-0.2500 


0.01 

-0.2507 

0.00054 

0.05 

-0.2535 

0.00271 

0.10 

-0.2570 

0.00543 

0.2 

-0.2641 

0.0109 

0.4 

-0.2783 

0.0220 

0.8 

-0.3071 

0.0457 

1.0 

-0.3218 

0.0579 

2.0 

-0.3975 

0. 1248 

5.0 

-0.6519 

0.3980 

10.0 

-1.195 

1.279 
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Roll rat# (p) 

Rolling momant (L # ) 



( c ) Simplified aircraft nomenclature * 
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Fig* 2 Concluded • 


Fig* 3 Time his- 
tory with no meas- 
urement noise • 
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Tima, aac Tima, aac 


Fig* 4 Comparison of measured and 
computed data for each of the first 
three iterations . 


AHaron 
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Fig* 5 rime his- 
tory vith meas- 
urement noise . 
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Tima, aac Tima, aac 


Fig. 6 Comparison of measured and 
computed data for each iteration • 
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Roil rata, 

P, 10 
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Tima, aac 

Fig* 7 comparison 
of estimated roll 
rate from no-noise 
and noisy cases • 





Fig. £ Cost function as a 

function if L p for no- noise case . 



Damping In roll, 

Fig. 9 Cost function as a function 
of Lp for noisy case • 
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Noiay 



Fig. 10 Comparison of the cost Fig. 11 Gradient of J(L p ) as a 

functions for the no-noise and noisy function of L p for noisy case, 

cases . 



Fig. 12 Comparison of Newton- 
Raphson and Gauss-Nf values of 
the second gradient the noisy 



Fig . 13 Comparison of first iter- 
ation step size for the N ewton- 
Rapheon and Gauss-Newton algorithms 
for the noisy c a»e. 


case. 
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Cmtmi mkm 
m4 mtknkmm 


Fig . I 7 M#tr of cost £um* 

tiom surface for fi>*j mime case* 
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IS tec Wm, 
*l|lw 1, 
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Co met «tit# 


oiM 


Fif. J9 Detailed view of ox 
tion surface fox misy case 


Fig. 18 isoclims of cotwtarA cost 
in L n aM La fox the misy case- 




Pig. 20 isoclines of constant cost 
for region near minimum for noisy 
case. 



Fig. 21 isoclines and uncertainty 
ellipsoid of the cost function for 
the noisy case. 
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Fig. 23 Lateral -directional apace 
shuttle maneuver at a Mach number of 21. 
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Fig. 24 Examples of obtaining Lyj by 
simple calculations for the shuttle 
data from Fig. 23. 
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(c; Two iterations. 


(d ) Three iterations, 


Fig . 27 Continued. 
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Pig . 28 Variations of C n ^ with angle 
of attack without uncertainty levels. 


(e) Four iterations. 
Fig. 27 Concluded. 
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rig. 29 Variations of C Dp with angle of attack 
with uncertainty levels. 
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Fig. 30 Match of flight data obtained 
in turbulence (state noise ) and com- 
puted data obtained from maximum 
likelihood estimator that does not 
account for turbulence. 


Fig. 31 Match of flight data obtained 
in turbulence (state noise ) and com- 
puted data obtained from a maximum 
likelihood estimator that accounts 
for turbulence. 
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A 

Fig. 32 Power spectral density of $g 
obtained from maneuver shown in Fig . 
31. 



Frequency, Hz 

Fig • 33 Power spectral density of pg 
obtained from maneuver shown in Fig. 
31. 


Flight 



Fig • 34 Structural mode oscillation 
observed on the lateral acceleration. 



Fig. 36 Cost functional for 
10 cycles of data as function 
of frequency , showing close 
proximity of local minima 
to global minimum. 
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Fig. 35 Hatch of measured and com- 
puted lateral acceleration obtained 
when maximum likelihood estimator 
converged to local minimum . 



Tlm«, mc 

Fig. 37 Simple scalar example illus- 
trating a local minimum similar to 
that shown for flight data in Fig. 35. 
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Fig. JS Simple scalar example showing 
only the first cycle . 



Fig. ?9 Cost function for one cycle 
of data as function of f zequency , 
showing wide region of convergence 
for global minimum • 
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Pig. 40 Acceptable match of Pleasured 
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Fig. 41 Dynamic response of space 
shuttle to firing of roll and yaw 
jets at an altitude of 430,000 Ft. 



TU.ia, aar 

Fig. 42 Maximum likelihood match 
of rigid-body response of the space 
shuttle . 
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Fig . 4J Differs 7 " , > en measured and computed rigid~body response 
( residual ) for «. shuttle. Altitude * 430,000 ft; dynamic 

pressure * 0* 
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SUMMARY 

The optimum sensor location problem, 0SL.P, may be thought of in terms of the 
set of systems, S, the class of input time functions, I, and the identification 
algorithm (estimator) used, E. Thus, for a given time history of input, the 
technique of determining the OSL requires, in general, the solution of the opti- 
mization and the identification problems simultaneously. * However, this paper 
introduces a technique which uncouples the two problems. This is done by means 
of the concept of an efficient estimator fpr which the covariance of the parameter 
estimates is inversely proportional to the Fisher Information Matrix. 

INTRODUCTION 

The problem of Ftructural identification in structural engineering is one 
which has received considerable attention from several resear r 'ers in the recent 
past (Refs. 1-4). Though various methods have been developed for identifying the 
different parameters that characterize a structure from records obtained in them 
under various loading conditions, few investigators, if any, have looked at the 
question of where to locate sensors in a structure to iequ^ *» data for "best 11 
parametric identification (Ref. 5). The problem of optimally locating sensors in 
a structural system arises from considerations of* (1) minimizing the cost of 
instrumentation; ana (2) efficiently detecting structural changes in the system 
with a view to acquiring improved assessment of structural integrity. 

The problem addressed in this paper can be stated as follows: Given m sen- 

sors, ..here should they he located in a structure so that records obtained from 
those locations yield the %est’* estimates of the unknown parameters? 

In the past, the optimal sensor location problem (OSLP) was solved by 
positioning the giv^n number of censors in the system, using the records obtained 
at those locations with a specific estimator, and repeating the procedure for 
different sensor locations. The set of locations which yield the "best" parameter 
estimates would then be selected as optimal. The estimates obtained, of course, 
would naturally depend upon the type of estimator ujed. Thus the optimal loca- 
tions are estimator dependent, and an exhaustive search needs to be performed for 
each specific estimator. Such a procedure, besides being highly computations v 
intensive, suffers from the major drawback of not yielding any physical inc* t 
into why certain locations are preferable to others. 

Recently, work r the solution of the OSIP was done by Shah and Udwadia 
(Ref. 5). In brief, .hey used a linear relationship between small per turbaf ions 
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in a finite dimensional representation of the system parameters and a finite 
sample of observations of the system tLne response. The error in the parameter 
estimates are minimized, yielding the optimal locations. In this paper , we develop 
a more direct approach to the problem which is both computationally superior, and 
throws considerable light on the rationale behind the optimal selection process. 

We uncouple the optimization problem from the identification problem using 
the concept of an efficient estimator (e.g. , the maximum likelihood estimator as 
tine becomes very large). For such an timator the covariance of the parameter 
estima es is a minimum. Using this technique and motivated by heuristic argu- 
ments, a rigorous form lation and solution of the OSLP is presented. The method 
is applied to a building structure modelled as a general linear dynamic system. 
For the N degree of freedom system considered, the methodology for selecting 
m(m<N) of the nodal displacements for purpose of measurement is presented. 

Sample calculations are made for a simple building &t T ‘ucture modelled as a 
tvo-degree-of-f reedom system subjected to base excitations. The u^ti^al sensor 
location for the identification of: (a) the mass ratio; and (b) the stiffness 

ratio is investigated. 

The results indicate chat the OSLP depends on: 

1) the class of systems, S, to which the structure belongs; 

2) the type of excitation; 

3) the actual system parar^ters involved; and 

4) the parameters to be identified. 

THEORY 

Consider a system modelled by the equation 

MX + CX + KX = F(t) (1) 

where M, C, and K are the (NXN) mass, damping and stiffness matrices, F(t) is 
an (NX1) vector containing inertial forces and extei/.ally applied loads and X 
is the N-vector of nodal displacements. Let 0^, 0 q and be vectors containing 
t~he various parameters related to the mass matrix, f he damping ma.rix and the 
stiffness matrix, respectively, whicn need to be identified. For convenience, 
we collect these quantities in the parameter vector, 0, defined as 



where the superscript T indicate matrix transpose. If the M, C and K are 
symmetric each of the three subvectors has a maximum dimension of N(N-fl)/?, 

Given m sensors (m < N) , we Jhen need to find where to locate them so that 
the covariance of the estimate, 0, is a minimum. Assume further that the 
measurement vector Z(t) can be expressed as 
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i = 1,2, ...,N 


( 2 ) 


^U) = g.[X(6,t)] + N ± ( t) 

where is the ith component of Z(t), and the functionals represent the 
"measurement process" - The dependence of the response X on the parameter 
vector 0 is explicitly noted The measurement noise N^(t) is taken as non- 
stationary Gaussian White noise with a variance of i|/2 ( t ) . Therefore, 

E [ N i (t l )N j (t 2 ) ] = * 2 ^ t 1 )6 K (i-j)« I) (t 1 -t 2 ) . (3) 

where 6^ and 6j) stand for the kroneker and the dirac-delta functions, 
respectively. A total of m out of N responses need to be selected so that 
they contain the most information about the system parameters and are maxi- 
mally sensitive to any changes in the parameter values. This "selection" 
process can be represented by an m-dimensional vector Y such that 

Y(t) = SZ(t) (4) 

where S is the (m * N) upper triangular selection matrix with each row con- 
taining null elements except for one which is unity. The m different compo- 
nents of Z selected to be measured are so ordered in vector Y, that if the 
element in the i-th row and k-th column of S is unity, the (i+l)-ith row has 
unity in its Jl-th column with l > k. The matrix S has the property that 
P = S T S in an (NXN) diagonal matrix with unity in its i-th row if, and only 
if, Z-£ is selected to be measured. The elements of P are otherwise zero. 

Hence, one can write 

Y(t) = Sg[X(9,t)] + SN(t) (5A) 

4 H f X ( 0 , t ) ] + V(t) (5B) 


If is linearly related to che response X^ , in general, then 


H [ X( 0 , t ) ] = SRX 


( 6 ) 


where R(t) can be thought of as a dynamic gain matrix. In the case that g^ 
is related to the resporse X^ only, then matrix R will reduce to a diagonal 

matriX ’ [ °j J* 

The problem of locating sensors in an optimal manner then reduces to 
determining the selection matrix S, or alternatively, finding the m locations 
m P that should be unity. These locations must be so chosen as to obtain the 
"best" parameter estimates. 

SOME MOTIVATING THOUGHTS AND THE FISHER INFORMATION MATRIX 


Consider a ^ase in which one tries to estimate only one parameter, ©x (to 
be identified) involved in a dynamic system model with only one sensor provided. 
Therefore, one wants to ideally choose a location i (out of N possible such 
locations) such that the measurement yi(t), ie[l,N], te(0,T) at location i yields 
the best estimate of the parameter ©i. Heuristicaliy , one should place the 
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sensor at such a location that the time history of measurements obtained at that 
location is most sensitive to any changes in the parameter He~ce, in equa- 

tion (5B) it is really the slope of H[X(0i,t)J with respect to that needs to 
be maximized. However, since only the absolute magnitude of this slope is of 
interest, it is logical to want l find i (or equivalently determine the selec- 
tion matrix S described previously) such as to maximize (3H/30 ;l) 2 over the inter- 
val (0,T) during which the response is to be measured. This leads to maximizing 
the following integral: 



When there is more than one parameter to be estimated, and the number of 
sensors is greater than unity, this intuitive approach needs to be extended in a 
more rigorous manner. In such cases recourse to math ema tical treatment is 
necessary, and we shall see that such treatment will be in agreement with our 
heuristic solution outlined above. 

To further understand the problem, let us loo 1 ; at it from another angle, 
namely, the concept of an efficient unbiased estimator. For such as estimator 
che covariance of the est im ates is a minimum. Furthermore, it can be shown that 
for any unbiased estimator of 0, 


|(e-0) (0-e) T | > 

/(H) 1 

(i§) /* 2 (t)at 

l J 

Lo 

_ 


( 8 ) 


where § is the estimate of 0 and the matrix [3H/30]^^ 4 3Hj/30j. If the esti- 
mator is "efficient", the above inequality becomes an equality. This means that 
the left-hand side of inequality (8) takes its lowest value (minimum covariance) . 

Hence, 


E ( 0—0) ( 0—0) T 



(9) 


The term inside the bracket on the right-hand side of the equation (9) is known 
as the Fisber Information Matrix, Q(T). Thus, maximizing Q(T) would indeed lead 
to a minimization of the covariance of the estimate, 0. 

Wa note then that the m sensor locations need to be so chosen that a suitable 
norm l the matrix Q(T) given by 


Q(T) 



0 


( 10 ) 
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is maximized. This constitutes an extension of equation (7), which we heuris- 
tically derived earlier for the scalar case, to the vector situation. Introducing 
equation (6) in equation (10) one may write 


Q(T) 


/ 


T T 

V PRX 9 

* 2 (t) 


dt. 


(ID 


where the ij element of can be written as: 

r i 

l x eji * = 1 e ll >^> j e (1 > m) 

j 

where X = {x^}^ and 0 = {6^}^. We note that the Fisher Matrix is symmetric and 
is dependent on che length of the record available, as well as the locations of 
the sensors as determined by the matrix P. 

If the m locations where the sensors are to be placed are denoted by 
s^, k = 1,2, then 


P = 


m 


£ 

k=l 



(12) 


where the (N * N) diagonal matrix' Is^ has all Its elements equal to zero except 
the element of the row, which is unity. Noting that P is a diagonal matrix, 
equation (11) can be simplified to yield 


Q[T;s 1> s 2 ,... > s n ;S t e;I] = 


m r 

E / 

k»l „ 


T T 

T r X A dt 

6 S k S k 6 


^ 2 (t) 


(13) 


where rs^ is the row of the matrix R. Also in eq. (13) explicit mention is 
made of the dependence of the Fisher Matrix on the time length T of the available 
data, the syscem S, the parameter vector 6, and the time-variant input I. If the 
matrix R is diagonal, with diagonal elements then the ij element of 

the matrix Q, after some manipulation, reduces to 


} ij[ T;s l ,s 2 ,S m ;S,9;I ] X# f 


3x 3 
s. s. 
k k 

ae i 



(14) 


Each element of 0^ represents the cross-9ensitivity of measurement with respect 
to the response x of node s, . 

S . K, 

k 


251 



The optimal sensor locations are then obtained by picking m locations s^, 
k - l,2,...,m, out of a possible N, so that a suitable norm of the matrix Q is 
maximized (e.g., the trace norm, etc...)* This may be specified by the 
condition 

max | | Q |T » s-t, » • • * » s j S , 9 5 il [ J • (15) 

s k e(l,N) L J 

Although there are several matrix norms which could be used, perhaps the 
most useful and physically meaningful in this context is the trace norm. In 
order not to detract the reader from the basic methodology we defer an exhaustive 
treatment of suitable matrix norms to a later communication. 

The methodology presented up to this point is valid for both linear and non- 
linear systems since the criterion developed in equation (13) was derived using 
only equations (3) and (9). We will now indicate its application to linear 
multi-degree-of-f reedom systems. 

APPLICATION TO LINEAR DYNAMIC SYSTEMS 

Consider the N-degree-of -freedom dynamic system whose governing differential 
equation of motion is given by eq. (1), together with X(tg) = Xg, X(to) = Xg, 
where Xq and Xg are the given initial conditions for the system. Assume the 
system to be classically damped. Introducing 


X(t) = *n(t) 

where ♦ is the (N x N) weighted modal matrix and q(t) is the N-vector of 
generalized coordinates we get 

o 

n + 25 N w N T>fAn = * T F(t), n(t Q ) * 4 > Tmx 0 , n (t Q ) * <j> MX Q , 
where the (N x N) diagonal matrix A is given by 


M ■ ,I|!t ■ NJ • “ d - [\ N ] • 

The solutic.i of equation (17) is given as 

t 

n.(t) = n 0 u i^ t " t 0^ + ^ l 0 V i^ t_t 0^ + / h i ( t-T )P i ( T ) dT 

i i ( 

0 

where and are initial conditions and 


(15) 


(17) 


(18 
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Cosua t + Sinw, t 

d i \ d i 


u,(t) - ExW-E.^t) 

v i(t) - E:<!(-i iV ))si«u d t, 




h^(t) - V ± (t), 

“d = u i \/ L " C i * and 
P.(c) = $ T F(t), i = 1,2,. ...N. 

Also, differentiating equation (1) with respect to 0, yields 
MX 0 +CX 0 +KX e = F Q (t) - (M e X+C 0 X+K 0 Xj; X Q (0) = 0, X 0 (O) = 0 


where 


K] y - 


with 


A. • 

M fi x = [M a x : M 0 x • M e x : M e X] 
e e i • °2 i l 


i = 1 , . . . , N , and j * 1 , . . . , L . 

Introducing 


X 0 = *z 


(19) 


yields 

z+2C n ^z+Az = G(t) 


( 20 ) 


where 


G(t) = <t T 


• //\. A A 

V( M e x+c e x+K e x 



Equation (21) can further be simplified to give 


G(t) 


T r //\ A A \-i 

♦ [ F 0"( M e trTfC e $n+K 0 $n )J 


( 21 ) 


( 22 ) 
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where rj and n can be obtained by differentiation of eq, (18). This may be shown 

as follows 


\(t) = r io i W i( t-t o) + ^O i Y i( t " t o) + J ' H 1 (t-T)p i (T)dT 


where 


vr(t> = -exp | 


Y i (t) = EXP^-CjUjt) 


(— £ . u. t\ 

Ui . + 

l i i / 

d i “d 4 


Sin t , 


Co so), t 
d i 


i\ Sin t 

\\) dl 


^(t) = Y jL (t), and 
p t (t) = $“F(t) , i = 1,2,. 


,N. 


Also 


t 

n i (t) = n o i Wi( t “ t o) + \ Y i( t “ t o) + / 

t o 


^(t-Op^TjdT 


where 



(23) 


(24) 
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Y t (t) 







Sin uk t 
d. 


- 2^ (d.Cosco, t 
i 1 

Mt) = Y.(t), 

X i 

p i (t) = $ T F(t) , i = 1,2, 



Therefore, substituting equations (23) and (24) into equation (22) gives G(t). 
Consequently the solution of equation (20) can be written as: 


z..(t) = f h (t-r)G. . (x)dx 
ij J i ij 

0 


(25) 


where h^(t) is the same as that of eq. (18). Notice that the initial conditions 
in eq. (20) are zero. This is due to the fact that the initial conditions of 
(18) are known constants. 

If we assume that [C] is expressed as a linear combination of [K] and [M] , 
then eq. (22) can further be simplified. Namely, 

C * 2aK+26M, (26) 


where a and 6 are known constants. Hence in equation (17), the percentage of 
damping, can be expressed as: 


5 i = 


aui + 
i wi 


i.u * ^ 1, 2, . . . ,N 


(27) 


To further simplify equation (22) under this assumption, let us consider the 
following three cases: 

1) The vector 0 contains only 9^, i.e., only estimation of mass param- 
eters is undertaken. Then 

T . 

G(t) = * [F 0 - M 0 <t>(n+2Bn)] * (28A) 

2) The vector 9 contains only the subvector 0^. Then 
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r /\ . 

G(t) = 4> [F 0 - K 0 <Kn+2an)] 


(28B) 


3) Finally if the vector 0 = [a g] , 


X • T • X 

G(t) - F - 2 An > F - 2Iiv 

a 6 


(29) 


If the input F(t) is not a function of 6, then Fq would be omitted all through 
this discussion. Once the solution of equation (25) is obtained, the Fisher 
Matrices may be obtained as in equation (13). Hence 


T T T 

T z $ r r $z 
m /• s s. 

q - r. f — . k * dt 


t f 

Vsl ~ 

0 


/(t) 


(30) 


Je note that the summation form of relation (30) is particularly amenable to the 
maximization of the trace norm of Q. 
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EXAMPLE 


To illustrate some of the ideas of the previous section, consider the problem 
of finding the optimal sensor location (OSL) in a structural system modelled by the 
two-degree-of-f reedom system (shown in Figure 1) which is subjected to the base 
excitation of f(t). 

The governing differential equation ot motion can oe expressed as 


M + CX + KX = -W f(t) 


(31) 


T T 

where X * <x^ x^> , C = aK, W = <Am and the matrices M and K are 



- 

- 


- 

• 


A 

0 


' B+l 

-i 

M = 

0 

1 

m , and K * 

-1 

i 


A case study for locating sensors to best identify (1) the mass ratio, A, of the 
first to the second floor and (2) the stiffness ratio, B, of the first to the 
second floor, will be presented. 

Let denote the lower mass location and S2 the upper mass location. The 
selection between the locations can be equated to determining the one non-zero 
element of the [1x2] selection matrix, S, with the measurement H(t) defined by 

H(t) = SX + V(t) , 


where, V r t) is Stationary Gaussian White Noise (S G W N) with ^(t)=^ Q . 


If S * [1 0] the lower mass is selected for measurement; if S * [G 1] the 
upper mass is selected. The location would then be preferred over the loca- 
tion S2 for identifying the parameter A, if Q[T,s^] > Q[T,S2l, where T is the 
time that the measurement is taken. 



(32 A) 
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Since only one parameter is being estimated the Fisher matrices reduce to 
scalars . 

The dependence of the OSL on various types of the base excitations can be 
studied now. Let us for this presentation consider ground acceleration in the 
form of a delta function, i.e., f(t) = 6(t) . 

In this case, closed form solutions foi and can be obtained. 

For the OSL problem for the "best" (minimum covariance) identification of 
the parameter A (given the parameters B and a) using an impulsive base excitation, 
Figure 2-A shows the plots of the ratio of the information matrices Ql(T)/Q 2 (T), 
for T = 50 secs, for various values of the parameters A (which is to be identi- 
fied) and a* 4 auJ o> where ojq 4 A:/m. Points on the graph with ordinates greater 
than unity indicate the optimal location to be the lower mass level and vice 
versa. The graphs indicate that the optimal location in most cases, for the 
range of A considered, is the upper mass level. However, we obse 've that for 
some small values of A and a* the OSL is the lower level. We note, interestingly 
enough, that the optimal sensor location for identification of A actually depends 
not only on the actual values of B and a which are presumably known, but also on 
the value of the parameter A itself which is to be identified! Thus to be able 
to ascertain the optimal sensor location some a priori assessment of A is 
necessary . 

Figure 2-B shows that the optimal location for identification of the param- 
eter B (given A and a), using an impulsive base input, is again the upper mass 
level for the range of B values considered. For larger B values, however, and 
ot*>0.05, the trend appears to be more and more in favor of the upper mass. This 
seems intuitively correct, for as B becomes larger, the lower part of the system 
becomes stiffer and the OSL would be the upper mass level. 

Figure 2-C is associated with the OSLP for estirating the parameter B usin ; 
a sinusoidal base excitation, f(t) 83 a sin wt. The figure shows that as the 
normalized driving frequency y = w/w Q varies, the OSL changes. For this example 
the Fisher Matrices can be computed in closed form. For the e stimat ion of B, 
(g' r ven A and a* * 0) the dimensionless driving frequency y = /14-1/A yields no 
information on B from records at either of the two mass levels. The responses at 
the two mass levels yield identical amour/ s of information on B at y = 0 and 
y = n for A I, as indicated by the values of Ql/Q2 - 1 at these frequencies. 
The value of QJ./Q2 - 0 at y = 1 is indicative of the fact that the upper mass 
level is . far better location for a sensor when estimating B with a* - 0. 

Figure 2-b shows the mean value of the r<_tio Q 1 /Q 2 for a random Gaussian white 
noise base excitation together with the 1-a band. The OSL appears to be at the 
upper mass level for identification of A. 
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CONCLUSIONS AND DISCUSSION 


This paper presents a general methodology for determining the optimal sensor 
locations in dynamic systems for obtaining records which would enable the "best" 
(minimum covariance) identification of a given set of unknown parameters in the 
system. The technique utilizes the concept of an efficient estimator to uncouple 
the identification from the optimization problem. In order to present the basic 
idea in as clear a fashion as possible, we have restricted the discussion in this 
sequel tc linear systems. 

The method has been illustrated by application to a two degree of freedom 
system. Though the results presented here for the simple system chosen form only 
a first step towards acquiring a detailed understanding of the OSL problem, the 
following conclusions appear to be relevant at this time: 

(1) The OSL for a given system heavily depends on the class of f orcing 
functions used for obtaining response data. In this study, an im- 
pulsive base motion is considered. 

(2) The OSL for linear dynamic systems is independent of the amplitude of 
the forcing function. 

(3) The OSL depends in general on all the values of system parameters. For 
instance, the OSL for estimating A with minimum covariance depends not 
only on the actual parameter values B and a but on the value of A it- 
self for the system! This implies that the OSL problem associated with 
identifying a given parameter (or a set of parameters) in a dynamic 
system necessitates the knowledge of some a priori estimates of the 
unknown parameter (s) . 

(4) ihe results of our simple example show that che OSL problem may yield 
solutions which may be dirficult to predict on pureJv heuristic 
grounds. The OSL appears to depend, even fur this relatively simple 
problem, in a rather complex manner on the actual parameter values of 
the system and the nature of the base excitation. 
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Figure 2 -A* Variation of Q 2 ./Q 2 for va^ous 
values of the parameter A, 
Ql/Q2 greater than unity 
indicates that optimal is 
at lower mass. 


Figure 1. 2-degree-of-f reedom 
generic structural 
system. 



Figure 2-3. Variation of Q^/Q 2 for various 
values of B. 
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ABSTRACT 


Parameter and state estimation techniques are discussed for an ellip- 
tic system arising in a developmental model for the antenna surface of the 
Maypole Hoop/Column antenna. A computational algorithm based on spline 
approximations for the state aud elastic parameters is given and numerical 
results obtained using this algorithm are summarized. 


I. IHTRODTCTIOH 


Results are presented from a Langley program directed towards 
developing computationally efficient identification techniques for flexible 
systems modeled by partial differential equations with an emphasis on large 
space structures. Initial efforts have been directed towards extending the 
spline-based theory and computational techniques used by the first two 
authors [l]-[6] in solving identification problems with delay and partial 
differential equation models in one spatial variable to solve distributed 
problems in several spatial variables. Additionally, ii. order to support 
Langley's technology development program [7] in large space antennae, a 
parameter and state estimation algorithm has been derived for a prototype 
distributed model of the Maypole (Hoop/Column) - antenna reflector 
surface [8]. The next section describes the Hoop/Coluan antenna and pre- 
sents the identificotion problem being considered. The state and parameter 
estimation approach is then outlined and discussed in tne context of the 
Hoop/Column application. Subsequent sections include mathematical details 
of the antenna application and numerical results. 
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II. THE MAYPOLE (HOOP/OOUMi) AMTEHMA 


For the purpose of technology development, the NASA Large Space 
Systems Technology (LSST) program office has pinpointed focus missions and 
identified future requirements for large space antennas for communications, 
earth sensing, and radio astronomy [7]. In this study, particular emphasis 
is placed on mesh deployable antennas in the 50-120 meter diameter cate- 
gory. One such antenna is the Maypole (Hoop/ Column) antenna shown for the 
100m point-design in Figures 1 and 2. This antenna concept is being devel- 
oped by the Harris Corporation, Melbourne, Florida, under contract to the 
Langley Research Canter [8] . 

The Hoop/ Column antenna consists of a knitted gold-plated molybdenum 
wire reflective mesh stretched over a collapsible hoop that supplies the 
rigidity necessary to maintain a circular outer shape. The annular 
membrane-like reflector surface surrounds a * telescoping mast which provides 
anchoring locations for the mesh center section (Fig. 1). The mast also 
provides anchoring for cables that connect the top end of the mast to the 
outer hoop and the bottom end of the mast to 48 equally spaced radial 
graphite cord truss systems woven through the mesh surface [8] • Tensions 
on the upper (quartz) cables and outer lower (graphite epoxy) cables are 
counter balanced to provide stiffness to the hoop structure. The inner 
lower cables produce, through the truss systems, distributed surface load- 
ing to control the shape of four circular reflective dishes (Figs. 1 and 2) 
on the surface. 

After deployment or after a long period of operation, the reflector 
surface may require adjustment. Optical sensors are to be locat'd on the 
upper mast which measure angles of retroreflective targets placed on the 
truss radial cord edges on the antenna surface. This information can then 
be processed using a ground-based computer to determine a data set of val- 
ues of mesh surface location at selected target points. If necessary, a 
new set of shaping (control) cord tensions can be fed back to the antenna 
for adjustment. 

It is desirable to have an identification procedure which allows one 
to estimate the antenna mesh shape at arbitrary surface points and the 
distributed loading from data set observations- It can also be anticipated 
that environmental stresses and the effects of aging will alter the mesh 
material properties. The Identification procedure must also allow one to 
address this issue* 

Considering the antenna to be fully deployed and in static equilib- 
rium, a distributed mathematical model which describes the antenna surface 
deviation from a curved equilibrium configuration is under investigation 
(for preliminary findings, see [9]). Using a cylindrical coordinate system 
with the z-axis along the mast, it is expected that the resulting model 
will entail a system of coupled second-order linear partial differential 
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equations in two spatial variables. The coefficients of these equations 
are functions of the material properties of the stretched mesh. The deri- 
vation and computer software for this model are still under development. 
In the meantime, a simpler developmental (prototype) problem has been 
solved which is descriptive of the original problem. 

For the developmental problem, the loading is assumed to be normal to 
the horizontal plane containing the hoop rim, and the mesh surface is 
assumed to be described by the static two-dimensional stretched membrane 
equation flO] with variable stiffness (elastic) coefficients and appropri- 
ate boundary conditions for the Hoop/Coluon geometry. Mathematically, in 
polar coordinates, we have 


• 7 It [ rE(r ’ e) It] - 7 le [ E(r > 0) It] • f(r * e) (1) 


where u(r,0) is the vertical displacement of the mesh from the hoop 
plane, f(r,0) is the distributed loading force per unit area, and 
E(r,0) >0 is the distributed stiffness (elastic) coefficient of the mesh 
surface (force/unit length). Equation (1) is to be solved over the annular 
region Q = [e,R] x [0,2*]. Appropriate boundary conditions are 


u(e,0) - u Q 
u(R,0) * 0 

along with the periodicity requirement 

u(r ,0) - u(r ,2*) , 


( 2 ) 


(3) 


where R is the radius from the mast center to the circular outer hoop, 
e is the radius from the mast to the beginning of the mesh surface (see 
Fig. 2), and uq is the coordinate at r » e of the meah surface below 
the outer hoop plane. 

We further assume that the distributed loading along with a data set 

of vertical displacements, u (r ,0 ), at selected points (r ,0 ) on the 

m 1 j J 

mesh surface is known. Given this information, the developmental problem 
is to estimate the material properties of the mesh as represented by 
E(r,0) and produce state estimates of the surface represented by u(r,0) 
at arbitrary (r,0) points within ft. The procedure applied to solve this 
problem is discussed in the next section. 
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III. THE SYSTEM IDENTIFICATION APPROACH 


The first two authors and their colleagues have derived technique? for 
approximating the solutions to systems identification and control probaems 
involving delay equation models and partial differential equation models in 
one spatial variable and have used them in a variety of applications 
{ll] t [12] • The Hoop/Column application requires an extension of the theory 
and numerical algorithms to elliptic distributed systems in several spatial 
variables. The approach, when specialized to the system identification 
problem, may be s t nmarized as follows: (1) select a distributed parameter 
formulation containing unknown parameters for a specific system; (2) mathe- 
matically "project** the formulation down onto a finite dimensional subspace 
through some approximation procedure such as finite differences, finite 
elements, etc.; (3) solve the Identification problem within the finite 
dimensional subspace obtaining an estimate dependent upon the order of the 
approximation embodied in the subspace; (4) successively increase the order 
of the approximation and, in each case, solve the identification problem so 
as to construct a sequence of parameter and state estimates ordered with 
increasing refinement of the approximation scheme; (5) seek a mathematical 
theory which provides conditions under which tue sequence of approximate 
solutions approaches the distributed solution as the subspace dimension 
increases with a convergent underlying sequence of parameter estimates. 

In applying this approach to the developmental problem, the stiffness 
function is parametrized In terms of cubic splines of fixed order; thir 
converts the estimation of E(r,G) into a finite dimensional parameter 
estimation problem. After writing the energy functional generic to the 
membrane equation, the Galerkin procedure is used to project the distri- 
buted formulation onto a finite dimensional state subspace spanned by ten- 
sor products of linear spline functions defined over ft. The approximate 
displacement (state estimate' thus obtained is expressible in terms of the 
spline basis functions. The Galerkin procedure in this case yields alge- 
braic equations which define the displacement approximation coordinates in 
terms of the unknown E(r,8) parameters. In order to solve the approxi- 
mating parameter estimation problem, the parameters defining E(r,0) are 
chosen so that a .east squares measure of the fit error between the 
observed and predicted (by the estimated state) data aet is minimized. 
Finally, following steps (4) aiJ (5) an algorithm is constructed to 
determine the order of the linear spline approximation above which little 
or no further improvement is obtained in the unknown quantities as one 
increases the dimension of the subspaces. Details of this system 
identification approach are presented in the following sections. 
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IV. FINITE DIMENSIONAL APPROXIMATIONS 


Prior to applying the Galerkin procedure [13,14] to perform the finite 
dimensional approximation for the developmental problem, the boundary 
conditions (2) are converted to homogeneous form by introducing the new 
dependent variable 


y(r,e> . u(t,e) - (f-E-§) » 0 


Equation (1) then becomes 

-7 If ('«'•«&)- 7 Is K- e> &) 




e - R 


(4) 


(5) 


with boundary conditions 

y(e,6) - 0 

y(R,6) - 0 (6) 

y(r,0) - y(r,2w) . 


Jollowing the standard formulation (see (13,14]) for the weak or 

A 

variational for- of (5), the energy functional E associated with (5) is 


E(z) - c r & E(r,©)Vz • Vz - (r,0)zj 


rdrd0 


(7) 


where V is the gradient in polar coordinates which, in the form used 
here, li. equivalent to 



1 3_\ T 
r 30/ 


The function f is giver by 


( 8 ) 


?(r,0) 


1 a /rE(r,0)u \ 


(9) 


and the vertical displacement z(r,0) of the mesh surface awa_, from the 
hoop equilibrium plane is a function satisfying the boundary conditions (6) 
and possessing first derivatives on SI in the distributional sense (we 


1 A A 

denote this by ze (ft) = Z). The first variation 6E of E about 

0 , per 

the function y(r,0) is given by 



6E(y;v) 



r,0)Vy 


Vv - ?(r,8)v} rdrd© 


n R 

{E(r,0)Vy • Vr 


ff(r,0)v + E(r ,0)k • Vv] } rdrd0 


( 10 ) 


where 



( 11 ) 


and v is an arbitrary function in Z * H* (0). 

1 O.per 

A 

Given a finite dimensional subspace Z of Z, the Galerkin procedure 

* A 

defines the approximation y as the solution in Z of 



,0)Vy • Vv} rdrdG 


r 2 * 



0)v + E(r,0)k 


Vv} rdrd0 

( 12 ) 


for all veZ. 

For computational efficiency, the basis functions used for the 

a 

representations of y in (12) are taken as tensor products of linear 

A A 

B-splines ([13], p. 27; [14], p. 100). Thus v and y are in the space 
spanned by 

v^ N (r,0) - o”(r)0j(8) , (i - 1,...,M - 1; j - I,...,N) , (13) 

where u^ - a^(r), (i - 1,...,M- 1), and 0^ - 0j(8), ^ ‘ ..,N- 1), 

are standard linear B-spllnes with knots uniformly spaced over fe,R] and 
[0,2*], respectively, modified to satisfy the appropriate boundary 

r Mi 

conditions. The elements [a f are modified to satisfy homogeneous 

N 1 

boun’iry conditions while 0 has been altered to satisfy periodic 

N 

boundary conditions [15]. 
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For y M,N (r,0) 


within the subspace spanned by 



tie can write 


M-l N 


y M,N (r,G) - I l a«(r)w«! N 6j(0) . 
i-1 j-1 J 3 


(14) 


a MM 

Replacing y(r,0) in (12) by y ' (r,0) from (14) and successively set- 
ting v(r,0) ■ v^’ N (r,P) for i * 1,...,M - 1 and j - 1,...,U leads to 

M N 

a set of high-order linear algebraic equations for the w^J coordinates. 


We avoid sparse matrix methods in solving the w 
imposing a separability condition: 


M,N 


U 


equation by 


E(r ,0) - E 1 (r)E 2 (0) . 


(15) 


M M 

As shown in [15], condition (15) reduces the w”’ calculation to the 
so-utio. of the matrix equation ^ 


B M w M ’ N a n + d m w m ’ n c n - e m ’ n 



(16) 

(17) 

( 18 ) 

(19) 

( 20 ) 

(21) 
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and 


-M,N 



f(r,©)a^(r) 0j(9)rdrd9 



c£(r) 


|J rdrdo'j 


( 22 ) 


where, in (17)— (22) , i,p ■ - 1 and j,q » 

Equation (16) is rewritten in the equivalent fora 


[(dVb m ]w m - b ♦ u M ’ N [c H (^)- 1 ] - (d m )- 1 e h 'V 




(23) 


and solved by the Bartels-Stewart algorithm [16]. 

J.n order to estimate, via a numerical scheme, the functional 
coefficients Ej and E 2 , we parametrize these functions so that 

identification is performed over a finite-dimensional parameter set. To 
this end, let 


1 

E. ( r ) * l v l(r) (24) 

k-1 


1 

E (0) - l 6 u (0) (25) 

j-1 * J 


where and 5j are scalar parameters and and y* are 

cubic B-spline functions defined [13, p. 61] over [e,R] and 10, 2*1, 
respectively, whose orders are independent of H and N. The basic spline 
functions are modified so that Mj and its derivatives satisfy periodic 
boundary conditions. 

We turn next to the computer implementation of the identification 
scheme. 
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V. COMPUTATIONAL PROCEDURE 


Appealing to the ideas found in previous sections, we now detail an 
algorithm for estimating the coefficients v^, k - 1,..., and 6j, 
j * 1,..., N| , for E(r ,0) that provide the "best fit" between 

estimations of the state u and observed data Uq obtained from various 
sample points on the surface. We may equivalently consider data for y by 
making the transformation 


Vvy ■ wy 



(26) 


for i * 1 ) • • « j L|> find j * 1 9 y Lq • 

A parameter estimation algorithm may je organized into the following 
steps* 


1 • Select an order of approximation for the cubic spline elements 
k * Hi and j ■ 1 Nj , used to 

represent Ej and E 2 * Set n » 1. 


2* Select M and N, number of the linear spline basis elements 
used to represent (and 


3. Assume a nominal set of values for 


v » 



i 


M 


1 


) 


and 


6 - 



(27) 


(28) 


4. Calculate the -oef ficlent matrices in (23) and solve for 
w^»k'(v,6). 

5. Calculate, from (14), y**»N(r.« k ,0j; v,6) and evaluate 

L r L 6 r 

j m, V,<5) - l v,6) - y^vVj * (29) 

6. Proceed to step 8 if jM,N( v> {) is sufficiently small. 

Otherwise, through an optimization procedure, determine a net pair 

* * m N 

(v,6) which decreases the value of J ' . If no such pair can 

be found, go to step 8. 
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7. Set (v,6) ■ (v,6) and return to step 4. 

8. Preserve the current values of and the corresponding 

(v,6) pair as the a*" entry in a sequence of these pairs, 

ordered with increasing H and N. 

9. Proceed to step 10 if sufficient data has been obtained to analyze 
the sequences. Otherwise, set n » n + 1 and return to step 2 
with increased M and N. The current values of (v,6) will be 
used as initial values for the next optimization process. 

10. From analysis of the numerical sequences, select the (M,N) entry 
which indicates the best numerical results. The corresponding 
parameter estimate (v,6) pair yields E(r,6) which determines the 
material properties of the antenna mesh. The matrix 
when used in conjunction with (14), determines a state 
approximation yM,N f or the shape of the antenna surface. 

A convergence theory for the identification algorithm may be found 
in [15]. Numerical results are described in the next section. 


VI. R0W1ICAL RESULTS 


Experimental data for the Hoop/Column antenna is not available at this 
time. Therefore, synthetic data is constructed to demonstrate the 
preceding algorithm. 


As shown in Figure 2, the parent reflector has four separate areas of 
illumination on its surface. ^ Each area is assumed to have the same 
parabolic shape. For 0 0 £ ^ an< * e £ r < R 


r 


-^-r- [ k (H^) "2 (e) + ‘]- 


36 


u°<r,S) 


V* - r) 



(0) + i l» 


*]' 


17* 


q 3 ( e) + 1 , JT < 0 < 2 


130) 
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where 


( 0 ) * slnO + cos 0 


(31) 


The functions ^ 2 ( ® ) an< * qj(0) are cubic polynomial fits used to 

7T O <jy 

ensure smoothness in regions of 0 near 0 - — , *, — , 2*. Formulae for 

q 2 < 0 ) and 03 ( 0 ) may be found in [15] • The parameter k > 0, a 

stretch factor used to perturb the surface below the conic (k * 0 ) shape is 
taken as 0.25. 


For the complete surface, we define, for e £ r R, 


u(r,0) - ^ 


u^(r, 0 ) , 


■° ( r - e 'i) • 


0 , 


0 < 9 < - 


2 < © < ^ 


3* 


u (r ,0 - ir) , n < © < 


0 / 3ir\ 3w . „ ^ - 

^ <i ^r,0 - j-J, j~<Q< 2* 


(32) 


It is expected that the mesh will be stiffest near the outer hoop 
(r » R) and around the inner radius (r =* e), For this reason we choose a 
known value of E}(r) as 

E^r) - 2t - x sin |» ~ (e < r R) (33) 

* 

where t is a constant dependent on the mesh material. The stiffness in 
the angular d.lrecion is expected to oe uniform with 


E 2 (0) = x . (34) 

From data ptovid : 8 ] for the 100-meter point design, a reasonable 

value for t (given in units /N/m ) is 

t - 3.391 ; (35) 

similarly, other parameters are calculated to be uq ■ -7.5m, e » P,235m 
and R - 50m. 
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tu,(r£,0j) is calculated by 


A 10 x 24 grid of data points 
evaluating u(r,0) at points (r£,0j) with 

r i “ e + i — 7~ (1-1,2 L r - 10) (36) 


0 j = [7.5° + (j - >) 15° ] (j - 1,2 L e « 24) . (37) 

Values of 0j correspond to data taken along every other radial cord 
truss system with reflectors assumed located on the gore edges. 
Distributed loads are obtained by substituting (32)-(j4) int^ (1) and 
evaluating f(r ,0^. 

In the examples of the identification process to oe presented, an 
equal number of linear spline basis functions are used in both r end 0 
directions. That is, M * N + 1 for an increasing sequence of N-values. 
The cubic spline approximations (24) and (25) are ueed with fixed 
Ml “ Nj ■ 4 to represent Ej(r) and E?^). The IMSL version 
(ZXSSQ) of the Levenberg-Marquardt algorithm [17 j is employed to minimize 
jM>N given by (29). For the first choice of N ? nominal (v,6) parameter 
values to initialize the Levenberg-Marquardt algorithm are obtained by 
finding those (v ? 6) coordinates which cause (24) and (25) to best approxi- 
mate assumed functions E^(r) and E^(^) chosen as guessed forms for 
¥j(r) and Is^©), respectively. For larger N, the latest previously 

obtained set of converged coordinates is used as nominal parameters. 
Nr rical calculations are performed on a CDC Cyber 170-series digital 
;L.putv>r using default values of the IMSL convergence parameters. 

Two measures of Identification scheme performance are employed. The 
quantity 


} M ’ N - 



(38) 


is used as a measure o2 state estimati>n accuracy. Additionally, 


rM.N 



x 100Z 


(39) 
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measures the relative error between the true 


E( r ,0) - E 1 (r)E 2 (G) 
and the estimatec E(r,G) denoted by 

E M ’ N (r,0) - E«« N (r)E^(e) 


(40) 


(41) 


which is calculated from (24) and (25) using the (M,N) th level of state 

approxit. rion obtained at step 8 of the computational procedure. In (39), 

i i M N 

y totes the L 2 norm on [e,R] x {0,2*]. R 9 provides a 

measure of parameter estimation accuracy. 


Convergence in the sense that 


rM.N 


-► 


0 


and 


?’ N 


-»• 


0 


as 


(M.N) * - 


depends on the ability of the cubic spline approximates (24) and (25) to 
accurately represent E^(r) and £ 2 ( 6 ). An exact pointwise fit can be 
obtained for ^(Q) by choice of the 4 6 -coefficients in (25). However, 
Ej(r) can at best be approximated to 


I E , ( r ) - E (r) | 

1.23Z 

E (r) 


relative error by (24) and (27) with M. ■ 4, Consequently, entries in the 

M N N A 

(R ’ , J ’ ) sequence can be expected to cease decreasing past some (M,N) 

value. Less realistic examples in which i24) and (25) exactly fit simpler 


V 
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Ej(r) and £2(0) functions, and and R**’** monotonically decrease 

with increasing (M,N) can be found in [IS]. Also, using the best cubic 
spline fits to E^(r) and £^(9) obtained from (24) and (25) to 
define E(r,6), along with the exact f(r,G) data, we observed that 


?’ N - 0.087 


uniformly in (M,N). The following numerical results show that the 
parameter estimates froa the identification procedure tend to iaprove 

(reduce) this J M,N value at the expense of 

Example 1 : Estimate £3(0) boiling Ej(r) fixed at the best 

cubic spline estimate of E^fr) using (24). Nominal parameters for the 
N - 4 starting value are obtained by fitting (25) to 

E^O) - 1 + | cos 0 . 

Four 6-parameters are estimated and results summarized below. 


N 

?’ N , . 

r m ‘ n , z 

CP 

time, sec 

4 

0.0390 

5.13 


8 

6 

0.0384 

5.57 


23 

8 

0.0322 

5.69 


86 

10 

0.0347 

6.01 


105 

12 

0.0330 

5.83 


132 

Essentially no 

Improvement in state 

estimate was obtained 

past N - 8. The 

e M,N(q) ter< j e< j to 3*591 instead of 

E 2 (6 ) h 3.391 

• The • 0.20 bias Is 

attributed to 

the inability A (24) 

to exactly fit 

E^(r) 

• 
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Example 2 : Estimate Ej(r) holding E 2 (©) fixed at the best 
cubic spline estimate of using (25). Nominal parameters for the 
N * 4 starting value are obtained by fitting (24) to 


Four (►-parameters are estimated and results summarized below. 


N 

J M ’ N , m 

r m ‘ n , z 

CP time, sec 

4 

0.0355 

32.25 

22 

6 

0.0343 

24.5 

41 

8 

0.0270 

4.39 

75 

10 

0.0293 

13.17 

103 

12 

0.0275 

8.08 

130 

14 

0.0273 

7.44 

168 

16 

0.0271 

7.59 

222 

18 

0.0267 

7.68 

292 

20 

0.0264 

8.03 

370 

22 

0.0260 

7.91 

460 

24 

0.0267 

8.11 

578 

26 

0.0250 

7.49 

751 

28 

0.0203 

7.58 

847 

30 

0.0259 

7.71 

1050 

From 

a state estimation 

viewpoint, N ■ 

' 28 provides the best 

accuracy. 

Overall, considering 

state, parameter 

and ease of computation. 

N =■ 8 is 

values of 

best* Figure 3 shows the character of 
N* 

E^’^(r) for selected 
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Example 3 ; Estimate both Ei<r) and E 2 (0). Nominal parameters 

are obtained as before for N ■ 4 from 

Ej(r) = 5 

E^Q) - i - j sin 0 

For each N, the first coefficient, 6j, is held fixed at its initial 
value. Seven parameters are estimated. 


N 

J M ’ N , - 

R M * N , 2 

CP time, sec 

4 

0.0356 

32.24 

40 

6 

0.0341 

28.71 

67 

8 

0.0270 

4.42 

168 

10 

0.0293 

13.18 

209 

12 

0.0275 

8.09 

256 

14 

0.0273 

7.45 

337 

16 

0.0271 

7.59 

411 

IS 

0.0267 

7.69 

490 

20 

0.0264 

8.04 

567 

22 

0.0262 

7.90 

651 

24 

0.0260 

8.12 

768 

26 

0.0260 

7.47 

945 

Again, from 

overall considerations. 

N - 8 gives 

the best results 
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VII. CONCLUDING REMARKS 


In all examples we have been able to successfully estimate the surface 
shape of the model antenna. Simili" results have been obtained where 
random noise (approximately 5% noise level) has been added to the data. 
These and other findings may be found in Section VI of [15]. 
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Figure 3.- Estimate E^r) with E 2 (6) fixed. 
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ABSTRACT 

This paper describes an on-going simple laboratory experiment, referred to 
as the Beam Control Experiment (BCE) , which has the essential features of a 
large flexible structure. The experiment is used to develop and evaluate iden- 
tification and control algorithms which look promising in the active control of 
high performance large space structures. Some' results on the maximum likeli- 
hood identification of the parameters of the beam-actuator-sensor assembly from 
experimental data is presented in the paper. 

I. INTRODUCTION 

One of the major problems in the design of control systems which operate 
in the presence of a flexible structure is obtaining accurate information about 
the plant dynamics. In particular, knowledge of the frequencies, damping ratios 
and mode shapes c the flexible modes is critical to the successful design of 
a high performance system. System identification is an iterative process, the 
success ot which depends upon the choice of algorithm and system model, the 
choice of inputs to excite the system, and the quality of output data. A care- 
ful integration of these items is especially critical in the case of large 
flexible structures. 

In this paper we describe the development and performance testing of a 
simple laboratory model of a jitter control sys un designed to provide a stable 
image with optical components mounted on a flexible structure. The study will 
be carried out in three stages: (a) identification with simulated data, (b) 
identification with real data, and (c) comparison of closed loop performance 
with simulated results. Results from the first two stages are reported in 
this paper. 

This paper is organized as follows: Section 2 describes the experimental 

set-up and a mathematical model for the BCE is developed in Section 3. A brief 
description of the maximum likelihood estimation (MLE) algorithm is presented 
in Section 4. Results on the Identification of the parameters of the BCE using 
both simulated and experimental data are discussed in Section 5. A summary 
and future work is described in Section 6. 

II. BEAM CONTROL EXPERIMENT 

The idea behind this experiment (Fig. 1) is to demonstrate the inter- 
action between the control of an optical system, symbolized by a laser beam, 
and control of a flexible structure, represented by a flexible aluminum beam 
to which passive and active mirrors are attached. These mirrors bounce the 
laser beam toward a desired target. The inteiesting control problem stems 
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from the fact that the active mirror is in fact part of a proof-mass actuator. 
Thus, any attempt to control the laser beam will tend to disturb the aluminum 
beam, thereby also disturbing the laser beam. This intricate coupling is 
quite a challenge for a classical design but more amenable to modern techniques. 
The other aspect of the experiment is the use of a commercially available 
cated microprocessor (1SI MCP-100) capable of handling at maximum a 32- e 
Kalman filter at a 2000-Hz sampling rate. Such implementation, aside f 1 its 
laboratory usefulness, bring control technology one step further toward real 
space implementation, and the experience gained is valuable. 

The schematic of the experiment is shown in Figs. 2 and 3. The laser beau 
bounces first on a mirror situated near the middle point of the vertical alumi- 
num beam, then on a mirror attached to the tip of a pivoted proof-mass actua- 
tor (PPM). The laser beam finally reaches a photodetector, which measures the 
laser beam position. A beam splitter provides a visual display of the jitter 
by projecting the spot on a remote screen. 

Two sensors are used in the experiment: the photodetector measuring the 

liae-of-sight (LOS) error, and the TPM rate sensor measuring the relative velo- 
city of the proof-mass. 

A preliminary experiment had been performed earlier (Ref. 1) using com- 
mercially available software for identification and control synthesis. Only 
one sensor was used (position) and the control system was able to signifi- 
cantly damp out the beam vibrations; thus stabilizing the line-of-sight. 

However, in order to eliminate the static error and achieve a higher perform- 
ance, a better model is needed and thus more sophisticated identification 
techniques are sought to that end. 

For purposes of identification, a known control force is applied to the 
beam using the proof-mass actuator. The control force time-history and the 
beam position and relative rate outputs are recorded on a Nicolet 4094 digital 
oscilloscope. Special software transfers the input and output data from the 
Nicolet 4094 to a Harris 800 computer where the identification algorithms 
were exercized. Thus an efficient link between hardware ter'*s and sophisticated 
computer analyses (Tig. 4) was established. 

III. MATHEMATICAL MODEL 

In this Section, a state space model of the system is developed. The 
mathematical form of this model will be used both for simulation and identifi- 
cation of the parameters of the BCE. 

The angular displacement, 0 a , of the proof-mass actuator is limited to 
to a few degrees. For small angles the force and torque applied by the actua- 
tor are given by the equations (Reference 2) 


"a 

(i) 

mb# 

(2) 


a a 
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where m is the mass of the proof-mass actuator, b is the distance from the cen- 
ter of mass to the proof-mass pivot and I is the centroid inertia of the proof- 
mass actuator. The dynamics of tie aluminum beam will be defined in terms of 
the principal modes and mode shapes. Let be the modal amplitude of the i c ^ 

mode and define: 

= angular velocity of the i t ^ 1 mode 
th 

= damping of the i mode 

- translational mode shape of the i ^ mode at the 
beam tip (where the PPM is mounted) 

and <j>f = rotational mode shape of the i th mode at this tip. Let 4> im and $± m 
be the corresponding values for che mode shapes where the mirror is mounted 
on the flexible beam. The modax equations for the beam are: 

V i + 2 ^i w i + “i *i = ■ ^i f a “ *i T i = 1,2, .. M (3) 


where M is the number of modes represented in the model. Due to the actuator 
dynamics, the control force f applied to the actuator is related to f by the 
equation 


rf = rf - (K + mgb) (6 - 0 ) - D (0 - 0 ) (4) 

a c d l at 


where 


and 


r * lever arm of the actuator 
g * acceleration due to gravity 
K - spring constant of the actuator 
D * damping constant of the actuator 
0^ * rotation of the beam tip 


The rotation of the beam tip can be expressed in terms of the modal amplitudes 

by 


M R 

0 t = I \ q i ' 


(5) 
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Equations (1) - (5) can be reduced to the set of equations 

e* = - (K + mgb)/I e - D/I 6 + «C/I E q, + D/I E q, + 4 f (6) 

» « 3 XX 11 I C 


and 


■>1 ■ 


A i ®a - C i \ - 2 5 1 “i ’l - “l ’l + *1 E *i »i + c i £ *i ii 


+ B. f (7) 

ic v J 


where 



A, = 
1 

-K (I <f>* + mb((. i )/I 

(8) 


B. = 
1 

r/K 

(9) 

and 

C, = 
1 

A t D/K 

(10) 


Let yi be the displacement of the laser beam from the reference point \i,e., 
this is a measurement of the L.O.S. error). Let y 2 be the relative angular 
rate between the actuator and the beam tip. Then, 


y l ‘ 2 t ’i - (I 1 + l 2> 1 »lm + V. ] 

and 

y 2 = K - K m K - 1 q i 


(id 


( 12 ) 


where is the distance between the mirror on the beam and the mirror on the 
proof-mass actuator and ^2 *- s the distance between the photodetector and the 
mirror on the proof-mass actuator (See Fig. 3) . 

Equations 6, 7, 11, and 12 give a state space representation of the 
input/output behavior of the BCE with T0 a 0 a ^1 ^1 • • • ^mI as the 
state vector. , 

For a single mode model the equations are given by 


x * Fx + Gu (13) 

y * hx (14) 
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where 


and 


where 


Also, define 



0 

1 0 

0 

F = 

2 

~ui a 

-D/I a) 2 

z T 1 

D/I 


0 

0 0 

1 


" A 1 

- C 1 

c i*j - 2 'i“i 

G = 

1 1 

0 

r/I Ob/] 

f 

! 


h » 2 0 

2 ^lm ( *1 + V 

0 

L 0 i 

0 

"+1 i 


Ug = (K + mgb) /I . 


2/l(K-Hngb) 


(15) 


(16) 

(17) 

(18) 
09) 


£ 

The F, G and H matrices depend on the unknown parameters ( w a ,D,a>i,ei,4>i»4>]., 

♦ and the kn^wn parameters (l,m,b,r,£i and £2 ^ • The values of the 

known parameters are tabulated in Table 1* In this model the number of unknown 
parameters is equal to (6*H-2) where M is the numoer of modes. 


I 

4X1 0" 4 Kg-m 2 

m 

0.07895 Kg 

b 

0.06985 m 

r 

0.021 m 

*1 

0.165 m 

£„ 

0.217 m 

2 



Table 1. Known Parameters of the BCE 
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IV. MAXIMUM LIKELIHOOD ESTIMATION (MLE) 


There are several methods available for the estimation/ identification of 
parameters Figure 6 shows the main components of an identification method. 
The system to be identified and a mathematical model, M(p) , of the system 
is excited by a known input u. An error function, L(p,e), is formed from the 
outputs of the system and the model. Identification is the process of 
adjusting the model parameters p to minimize the error function. The choice 
of M(p) , L(p,e) and the adjusting mechanism foi p lead to different identifi- 
cation algorithms. In this paper, we shall restrict our attention to the 
maximum likelihood estimation of parameters. 

The MLE can be applied to a large class of problems and has good statis- 
tical convergence and accuracy properties. In addition, MLE is well suited 
for identifying physical parameters of the system. This is a drawback with 
most recursive algorithms. The main disadvantage of MLE is the amount of 
computation. However, the amount of computation can be reduced significantly 
by taking into account the special features of the dynamics of the large space 
structures . 

The flow of computation in the MLE is shown in Fig. 7. The mathematical 
model for the system is assumed to be 


x = F(p) x + G(p)u + (u 

(20) 

y = H(p) x + v 

(21) 


where x is the n-dimensional state vector, y the p-dimensional output vector 
and u the m-dimensional input vector, w and represent the process and 
measurement noise respectively. The matrices F, G and H depend on p, the vector 
of unknown system parameters. An input signal £u(t) , o < t < tjQ has been 
applied to the system and the output y of the system has been observed at dis- 
crete times t Q , tj, . . . t^. Further, it is assumed that x(o) and w are 
zero-mean gaussian random variables with 


cov 

(x(o)) 

= P(o) 

(22) 

cov 

(U>) 

- Q 

(23) 

and cov 

(V) 

- R 

(24) 

The identification problem 
experimental data u(t^) , 
y the output estimate, and 

consists 
y(ti) , 
e(t R ) be 

of estimating the parameters 
i *“ 1,2,...N. Let x be the 
the outout error 

p from the 
state estimate 


where = y ^k^ ~ Yv^)* ( 2 ^) 
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The negative log likelihood function, v(p,e), can be written as 

N t -1 

v (p,e) = - log L(p,e) = T. e(i) E(i) e(i) + log j B<i) j (26) 

i=l 

The maximum likelihood estimate of the parameters p is obtained by maxi- 
mizing the likelihood function L(p,e) (or by minimizing V(p,e)). This non- 
linear minimization problem has to be solved by numerical methods and makes the 
MLE computationally intensive. The computational aspects of MLE are discussed 
in Reference 4. 

V. IDENTIFICATION RESULTS 

Numerical results on the identit ication of the parameters will be pre- 
sented in two steps. First results from the identification of simulated data 
will be shown. This will be followed by results from experimental data. 

a. Simulation Results : Identif icaticn with simulated data was done 

to get a better understanding of the dynamics of the BCE, to pro- 
vide guidelines to set up the experiment and to test the MLE 
software. The system was simulated using 4 modes and was excited 
by a "bang-bang" type input with an amplitude of ±0.1 Newton. 

Figure 8 shows the laser beam position and relative velocity 
output from the simulation. This input/output simulated data 
was used to identify a single mode beam model of the system 

(see equations 13-17). The negative log likelihood function 
V(p,e) was probed at a few points to see its variation with param- 
eter p. Figure 9 shows the variation of the likelihood surface 
with w a and . The surface is well-behaved in these two variables. 
The damping terms £ a and were set to thg simulation values and 
only parameters (w a . and ^i m ) were allowed to 

vary. Table 2 shows the results based on simulated data. There 
is good agreement between simulated and estimated values. Now we 
are ready to try the identif ication with experimental data. 

b. Experimental Result s: The aluminum beam was excited by applying 

to the proof-mass actuator a sinusoidal force with a linearly 
varying frequency (so-called "chirp* 1 excitation) . Figure 10 shows 
the control force f c . The position and rate measurements are shown 
in Figure 11. As before, the single mode model will be used as a 
starting point for the identification of parameters (w a , D, 

$ 1 , <p . Initially, the MLE had convergence problems. 

These were related to one or more of the following causes: 1) large 

differences between the values of some of the actual BCE parameters 
and those of the original simulation, 2) bias in the input force 
and position measurements, and 3) error in rate measurement cali- 
bration. 
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The bias was accounted for simply by subtracting a constant from 
the force input and position output. The error in rate calibra- 
tion was taken into account by defining a scale factor a. This 
results in a new H matrix where 


2l 2 

0 

2( *i» - - < £ l + *2> *L> 

- 

0 

0 

a 

0 

R 


The scale facto^ was estimated along with the other 8 parame- 
ters. The estimated values are shown in Table 3,. The model 
parameters shown in Table 3 for "simulation values" were obtained 
from an extremely simplified model of the aluminum beam (canti- 
levered with a point mass at tip). Thus, it is not surprising 
that some of the values obtained from the identification are very 
different. In particular, values of rotational mode shapes are 
quite sensitive to local inertias and masses. 


H 

Simulated 

Data 

Estimate 

(Simulation 

Data) 

Estimate 

(Experimental 

Data) 

U) 

a 

46.36 

45.5 

37.18 

w i 

54.63 

54.58 

40.32 

C a 

0.005 

.005* 

85E-4 

5 1 

0.005 

.005* 

0.01 

♦l 

3.328 

3.49 

7.48 


-13.51 

-13.61 

-110.0 

♦i. 

1.068 

0.98 

21.0 


-10.6 

-10.3 

2.49 

a 

1.0 

1.0* 

0.80 





^These parameters were set to their simulation values. 


Table 3 Results with Experimental Data 
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figure 11s! 'vs a comparison between the measured c.nd estimated values of the 
outputs. The estimated values were generated for the pramcler set which 
resulted from identification using experimental data (Table 3). Figure 11a 
•hows the measured and estimated values of the position. Figure lib is a 
\ _ow-up of the same curve to show the difference between measured and estimated 
ilues. Figures 11c and lid show curves similar to 11a and lib for the rate 
easurement. There is very good agreement between the model output and the 
experimental data. 

This model will be validated by taking the direct approach. In the third 
stage of the experiment, a control system will be designed using the identi- 
fied model. The predicted behavior of the control system will be compared 
with its experimental behavior. 

VI. CONCLUSIONS AND SUMMARY 

In this paper, we have described a laboratory experiment which has the 
salient features of controlling an optical system located on a flexible struc- 
ture. The experiment will be used as a test bed for designing control and 
identification algorithms for large space structures. The parameters of a 
.Model suitable for designing a control system were identified using maximum 
likelihood estimation. The real test of a model is of course how well ic 
satisfies the goal of modelling. Currently, we are designing a control 
system based on this model and the results of this final stage will be 
reported in another paper. 
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FIG, 10 BEAM EXCITATION INPUT 


296 










N85-312 12 
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FOR MODAL PARAMETER IDENTIFICATION 
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ABSTRACT 


A method, called the Eigensystem Realization Algorithm ( EF 5 developed 
for modal parameter identification and model reduction of dynamic systems from 
test data. A new approach is introduced in conjunction with the singular value 
decomposition technique to derive the basic formulation of minimum order 
realization which is an extended version of the Ho-Kalman algorithm. The basic 
^ormulation is then transformed into modal space for modal parameter Identifica- 
tion. Two accuracy indicators are developed to quantitatively Identify the 
system modes and noise modes. For illustration of the algorithm, examples are 
shown using simulation data and experimentaldata for a rectangular grid struc- 
ture. 


I. INTRODUCTION 

The state space model has received considerable attention for system 
analyses and design In recent control and systems research programs One of 
these areas. In particular. Is control of large space structures. In order to 
design controls for a dynamic system It Is necessary to have a mathematical 
model which will adequately describe the system's motion. The process of 
constructing a state space representation from experimental data Is called 
system realization. 

During the past two decades, numerous algorithms for the construction of 
state space representations of linear systems have appeared In the control 
literature. Among the first were the works of Gilbert tl] and Kalman [2], 
introducing the important principles of realization theory In terms of the 
concepts of controllability and observability. Both techniques use the transfer 
function matrix to solve the realization problem. Ho and Kalman [3] approached 
this problem from a new viewpoint. They showed that the minimum realization 
problem is equivalent to a representation problem Involving a sequence of real 
matrices known as Markov parameters (pulse response functions). By nlnlmum 
realization is meant the smallest state-space dimension among systems realized 
that has the same input-output relations within a specified degree of accuracy. 
Questions regarding the minimum realization from various types of Input-output 
data and the generation of minimum partial realization are studied by Tether 
[4], Silverman [5], and Rossen and Lapldus [ 6 ] using Markov ^■'’-neters. Rossen 
and Lapldus [7] successfully applied Ho-Kalman [3] and Tether [4] methods to 
chemical engineering systems. A common weakness of the above schemes Is that 
effects of noise on the data analysis were not evaluated. Zelger and McEwen 
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[8] proposed a combinition of the Ho-Kalman algorithm [3] with the singular 
value decomposition technique for the treatment of noisy date However, no 
theoretical or numerical studies were reported in Reference [8 1 vnong follow- 
up developments along similar lines, Kung [9] presented anc algorithm in 
conjunction with the singular value decomposition technique V'. /ncorporate the 
presence of the noise. Note that the singular value decoir.x^ition technique 
[10-11] has been widely recognized as being very effective and numerically 
stable. Although several techniques of minimum realization are available in the 
literature, formal direct application to the modal parameter identification for 
flexible structures was not yet addressed. 

In the structures fieiu, the finite-element technique is used almost 
exclusively for constructing analytical models. This approach is well estab- 
lished and normally provides a model accurate enough for structural design 
purposes. Once the structure is built, static and dynamic tests are performed. 
These tes. results are used to refine the finite-element model, which is then 
use for-afinal analyses. This traditional approach to analytical model devel- 
opment may not be accurate enough for use in designing a vibration control 
system for flexible structures. Another approach is to realize a model directly 
from the experimental results. This requires the construction of a minimum- 
order model from the test data that characterizes the dynamics of the system 
at the selected control and measurement positions. The present state-of-the-art 
in structural modal testing and data analysis is one of controversy about the 
best technique to use. Classical test techniques, which may provide only good 
frequency and moderate mode shape accuracy, are often considered adequate for 
finite-element model verification purposes. On the other hand, advanced data 
analysis techniques which offer significant reductions in test time and Improved 
accuracy , have been available [12-16] but are not yet fully used, for example, 
Ibrahim [13] presented a method based on state space for the direct identifica- 
tion of modal parameters from free responses. Recently, Void and Russell [16] 
presented a method using frequency response functions and time domain analysis 
which can also identify repeated eigenvalues. A comparison of contemporary 
methods using data from tne Galileo spacecraft test Is provided by Chen [17]. 

Although structural dynamics techniques are generally successful for ground 
data, further incorporation with work from t.he controls discipline is needed to 
solve modal parameter identification/control problem. For example, it is known 
from control theory [18] that a system with repeated eigenvalues and Independent 
mode shapes is not identifiable by single input and single ouput. Methods 
which allows only one Initial condition (Input) at a time [13], will miss 
repeated eigenvalues. Also, if the realized system is not of a minimum order 
and matrix inversion is used for constructing an oversized state matrix, 
numerical errors may become dominant. 

Under the interaction of structure and control disciplines, the objective 
of this paper is to introduce an Eigensyscem Realization Algorithm ( ERA ) for 
modal parameter identifica <on and model reduction for dynanneal systems from 
test data. The algorithm consists of two major parts, namely, basic formulation 
of the minimum order realization and modal parameter identification. In the 
section of basic formulation, the Hankel matrix which represents the data 
structure for Ho-Kalman algorithm is generalized to allow random distribution 
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of Markov parameters generated by free decay reponses. A unique approach 
based on this generalized Hankel matrix is developed to extend the Ho-Kalman 
algorithm in combination with the singular value decomposition technique [10- 
11] . Through the use of the generalized Hankel matrix, a linear model is realized 
for dynamical system matching the input and output relationship. The realized 
system model is then transformed into modal space for modal parameter identifi- 
cations. As part of ERA, two accuracy indicators, namely, the modal amplitude 
coherence and the modal phase col linearity, are developed to quantify the 
system modes and noise modes. The degree of modal excitation and observation 
are evaluated. The ERA method thus forms the basis for a rational choice of 
model size determined by the singular values and accuracy indicators. 

Two examples are given to illustrate the ERA method. The first example 
uses simulated data from an assured structure. The effect of repeated eigenvalues 
on the parameter identification is shown. The second example uses experimental 
data from \ simple grid structure. Comparison of the ERA results with a finite 
element model of the grid is performed. Experimental results for a more complex 
structure— the Galileo spacecraft—are shown in Ref. [19]. 


II. BASIC FORMULATIONS 


A finite-dimensional, discrete time, linear, time-invariant dynamical 
system has the state-variable equations 


x(k+l) * Ax(k) + Bu(k) 

(1) 

y(k, = Cx(k) 

(2) 


where x is an n dimensional state vector, u is an m dimensional control input, 
and y is an p dimensional output or measurement vector. The integer k is the 
sample indicator. The transition matrix A characterizes the dynamics of the 
system. For flexible structures, it is a representation of mass, stiffness 
and damping properties. The problem of system realization is then the follow- 
ing. Given the measurement functions y(k), construct constant matrices [A, B, 
C] such that the functions y are reproduced by the state-variable equations. 
With different sets of inputs and outputs, several cases can be obtained. The 
simplest case, namely, single input and single output, is treated first to 
allow the reader familiar with notations for the treatment of multi-input and 
multi -output cases. 

Single input and single output 

For the system (1) with free pulse-response (or initial-state-response), 
the time-domain description is given by the function known as Markov parameter 

y(k) - CA k "lB [or y(k) » CAkx(O)] (3) 

where x(0) is the system ini tie' onditions and k is an integer. Note that the 
matrix B is a column vector (si .gie input) whereas the matrix C is a row vector 
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(single output). For free init1al-state>respon$e, the matrix B only represents 
the information of initial conditions rather than the control influence matrix 
as shown in Eq.(l). The problem of system realization is to construct matrices 
[A, B, C] in terms of the measurement function y(k) such that the identities of 
Eq.(3) hold. Now observe that 


where 


y(k) * VA k- lB [ or y(k) « VA k x(0) ] 
y(k) 



ryOO 1 


C ' 

* 

y(k+i) 

• 

; and V = 

CA 

• 


• 

_y(k+n-l)_ 


• 

CAd-1 


(4) 

(5) 


Assume that this nth order system has no repeated eigenvalues. There exists 
a row vector C from observability theory (Ref. 18) such that V has rank n. 
Consequently, rearranging Eq. (4) becomes 

y(k+l ) = VA*B * VAV^y(k) (6) 


Given the sequence of measurement vectors y(k+l), 
matrix H(k) is defined as 


H(k-l) * Cy(k) t y(k+l) 


y(k+n)] 


fy(k) M™) 

y(k+l) ,y(k+2) 


the general i zed 

..... y(k+n-l)l 
..... y(k+n) 


Hankel 


(7) 


[y(k+n-l) ,y(k+n) y(k+2n-2)j 


It immediately follows from Eq.(6) that 

H(k) = VAV'^K(k-l) » VA k V“ 1 M(0) 


( 8 ) 


or from Eq. (4) that 

H(k) « VA^[B. AB..... A^B] - VA*W 


( 9 ) 


where W is a controllability matrix (Ref. 18). Again if the system with order 
.! has no repeated eigenvalues, there exists a column vector B such that U has 
rank n. This means that H(k) is invertible if the system Is controllable and 
observable, letting k * 1. Eq. (8) will then determine the state matrix A In 
the following way 

VAV- 1 «H(1)H-1(0) (10) 

To rigorously prove this result, define E as the column vector E t *[ 1 ,0,...,0]. 
The measurement function y(k+l) can then be written by 

y(k+l) » ETH(k)E - ETH(k)H (0)H(0)E » ET[H(1)H (0)] k H(0)E (11) 

with the aid of Eqs. (8) and (10). Hence by Eq. (3), the triple [H(l)H“ l (0), 
H(0)E, ET] Is a realization in the sense that if the triple [A, B, C] in the 
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system equations (1) an'" (2) is replaced by the [H(1)H*1(0) , H(0)E, E T ], the 
measurement functions y(k) are reproduced as proved in Eq. (11). In other 
words, state variable equations (1) and (2) are transformed to the following 
equations 


x(k+l) « H(l)H-l(0)x(k) + H(0)Eu(k) 

(12) 

y(k) - E^k) 

(13) 

where 7(k) = V“!x(k). 

Let us summarize the case as follows. 

(14) 


A finite-dimensional, discrete time, linear time invariant dynamical system 
with a single input and a single output is realizable if the state matrix A has 
no repeated eigenvalues, and the system is controllable and observable. 

Hulti-input and Multi-output 

The time-domain description for this case is given by the pulse-response 
(or initial-state-reponse) function known as Markov parameter 

Y(k) * CA^B (or Y(k) - CA k [ Xl (0),x 2 (0) x m (0)]) (15) 

where xj(0) represents the ith set of initial condition and k is an integer. 
Note that B is a nxm matrix and C is a pxn matrix. The problem of system 
realization is that, given the functions Y(k), construct constant matrices [A, 
B, C] in terms of Y(k) such that the identities of Eq. (15) hold. The algorithm 
begins by forming the rxs block matrix (generalized Hankel matrix) 


H rs ( k— 1) 


rv(k) 

YUl+k) 


,Y(k+ti) 

.YUl+k+tx) 


Y(k+t s .i) 

, . . . ,Y( j^+k+tg_j ) 


|Y ( J r-1*^) (j r-l +lc+t l) • • • • (jr-l^^s-1 )J 


(16) 


where j-j (i *1 , . . . ,r-l ) and ti(i«l,...,s-l) are arbitrary integers. For the 
system with initial-state-response measurements, simply replace H rs (k-1) by 
H rs (k) % If Is easy to prove from Eq. (15) that Eq. (9) also holds for 
this multi-input and multi-output case. 


H rs (k) » ; V r 


CA 


and We - [B,A tr B A^B] 


CA Jr *l 


(17) 


where V r and W s are respectively the observability and controllability mai~ices 
in a general sense. Note that V r and W s are rectangular matrices with dimensions 
rp x n and n x ms respectively. Assume that there exist a matrix H* satisfying 
the relation 
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- I„ (14) 

where In is an identity matrix of order n. Define 0 p as a null matrix wlt'.i 

order p, E 1 = [Ip, 0 p ,..., 0 p ] and eJ » [I., fly...., Oy]. In view of Eqs. 

(16) and (18), the measurement function Y(k+1) can be obtained through either 
of two algorithms A1 and A2. Ihe algorithm A1 is 

Y(k*l) • EpH rs (k)E, • Ej* r » k H s H»V ( K s £, 

' E p T CY r *W s H*] k V r W s E,, 

• Ej[H rs (l)H*] k H,. s (0)^ 1 <’») 

and the algorithm A2 is 

Wl) - E^OOE, - 

■ EpV$C H,v r w sl k E. 

* EjH rs (0)[H»H rs (l)] k E. (20) 

fence, by Eq.(16), H„(0)E,, eJ] or [H*H«0). E». EjH rs (0)] 

is a realization. There fs no doubt that the matrix H* plays a major role in 
solutions (19) and (20). What Is H*? Observe that, from Eqs. (17) and (18), 

HrstOj^H^O) - - V r W s - H r$ (0) (21) 

H* Is a pseudo-inverse of H rs (0) in a general sense. When the rank of H rs (0) 

equals to the column number of H rs (0), then H**[[H rs (0)]^H rs (0)]“*[H rs (0)3*. 

If the rank equals to the row number, then H*<H rs (0)] T [H rs (0)[H rs (0)]*J"*. 
The matrix H rs (l)H* has been used In structural dynamics area to identify 
system modes and frequencies. 13 Both are special cases representing either 
single input or single output which can not realize a system that has repeated 
eigenvalues, or a noise-free system unless the system order Is a priori known. 
A general solution for H* is given below. 

For an nth order system, find the nonsingular matrices P and Q such 
that™. 11 

H fS (0) * PDQ T (22) 

where tho rpxn matrix P and the nxms matrix Q 1 are isometric matrices (all the 
columns are orthonormal), leaving the singular values of H rs (0) In the diagonal 
matrix D with positive elements [d^^f ••••dn]. The rank n of H rs (0) is 

determined by testing the singular values for zero (relative to desired 
accuracy) 1 ? which will be described In the next section. Define 

H rs (0)- PDQ T - [PD][Q T ] - P d Q T (23) 
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Each of the four matrices [pJ,Q*,Wc,vI] has rank and row number n. By 
Eq.(17) with k=0, 

v r w 5 = H rs (0) • P a q T (24) 

Multiplying on the left by P d and solving for Q* yields 

™s s < p d p d)’ lp d V r w s = Q T (25) 

T is nonsingular because if U * W S Q(Q T Q)”^* W $ Q, then TU* I by Eq. (25). Since 
TU = I = U T for nonsingular T ar.d *J then 

« s CQ(PjP d )-'pJ]V r ■ I„ (26) 


u eiKe» by Eq. (18) 

H # = [Q][(PjP d ) _1 Pj] = CQKtT 1 ? 1 ] * QP{ (27) 

The dimension of matrices Q and P^ with rank n are respectively msxn and 
nxrp. To this end, summarize the case as follows. 


A finite-dimensional , discrete time, linear time-invariant dynamical system 
with multi-input and multi-output is realizable in terms of the measurement 
function if the system is controllable and observable. 

Note that no restrictions on system eigenvalues are given for this case. 
In other words, this technique can realize a system with repeated eigenvalues. 
The system (1) with this realization will be transformed into the following 


equation 

x(k+l) * H rs (l)H*x(k) + H rs (0)E m u (28) 

y(k) * EjiT (k) (29) 

where x(k) = V s H*x(Y). Or (30) 

7(k+l) * H # H rs (l)7(k) + gi (31) 

y(k) * EpH rs (0)7(k) (32) 

where 7(k) = W s x(k) (33) 


The realizations (28)- (33) are not of minumum order, since the dimension of x 
is the number of either columns or rows of the matrix H rs (0) which is larger 
than the order n of the state matrix A for multi-input and multi-output cases. 

With the aid of Eqs.(17), (18) and (27), a minimum order of realization 
can be obtained from 
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Y(M) * EpH r s (k)E » * 

“ S p V i "'VV«r“s^ 

* E;H rs (0)«P5v r * k M s qpjH rs (0)t 1 , 

* EjH rs (0)Q[pjH rs (l)Q] k p5H rs (0)E m 

■ EpPdtP'nrst 1 >03 k Q T E„, (34) 

where Eq.(23) has been used to obtain the last equality. This is the basic 
formulation of realization for ERA. 

The triple [Pd H rs U)Q» Q T Em» E o p d^ * s a min ^ nw,n realization, since the 
order n of P*H rs (l)Q equals to the dimension of the state vector x. The same 
solution in a different form for the case where - tj > 1 (1«l,...,r-l) can 
be obtained by completely different approach as shown in Refs. [3 A 20]. The 
system (1) with this realization is written as 


x(k+l) * pjH rs (l)gx(k) + Q^u (35) 

y(k) - E T P d 7(k) (36) 

where 

x(k) = W s Qx(k) (37) 


A simple exercise such as replacing Y(k+1) by Y(k) in Eqs.(19), (20) and (34) 
shows that all the algorithms developed above are also true for the realization 
of a system with initial-state-response. 


Examination of Eqs. (19), (20) and (34) reveals that algorithms (Al) and 
(A2) are special cases of ERA. Al Is formulated by inserting the Identity 
matrix (18) into the right hand side of the state matrix A as shown In Eq. 
(19). On the other hand, A2 Is obtained by inserting the Identify matrix (18) 
into the left hand side of the state matrix A as shown in Eq. (20). However 
the algorithm ERA is formed by inserting the Identity matrix (18) Into both 
sides of the state matrix A as shown in Eq. (34). Because of the different 
insertion, Al and A2 do not minimize the order of the state transition matrix. 
Mathematically, if the singular val 3 decomposition technique is not included 
in the computational procedures, Al and A2 can not be numerically Implemented, 
unless a certain degree of artificial noise and/or system noise are present. 
Noises tend to make up the rank deficiency of the generalized Hankel matrix 
H rs (0) for algorithms Al and A2. Since the degree of noise presence is generally 
unknown, algorithrs Al and A2 are not recommended. 


III. MODAL PARAMETER IDENTIFICATION AND MODEL REDUCTION 


The presence of almost unavoidable noise and structural nonlinearity 
Introduces uncertainty about the rank of the generalized Hankel matrix and, 
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hence, about the dimension of resulting realization. By employing the singular 
value decomposition (SVD) technique, the rank structure of the Hankel matrix 
can be quantitatively displayed. Ihe set of singular values can be used to 
judge the distance of the matrix with determined order to a lower-order one. 
Therefore, the structure of the generalized Hankel matrix can be properly 
exploited to efficiently solve the realization problem. These include an 
excellent numerical performance, stability of the realization and flexibility 
in determining order-error tradeoff. 

Assume that, by Eq. (22) 

0® Diag. [dj, d2,...,d^,dp^i ,..., d|j] (38) 

with 

d H ^2 > ..* 1 d n >_ d n +l >,...>. dN (39) 

If the matrix H rs (0) has a rank n then all the singular values dj(i®n+l N) 

should be zero. When singular values di(i*n+l,...,N) are not exactly zero but 
very small, then one can easily recognize that the matrix H rs (0) is not far 
away from a n-rank matrix. However, there can be real difficulties in deter- 
mining a gap between the computed last nonzero singular value and what should 
be effectively considered zero, when measurement noise is present. Possible 
sources of the noise can be attributed to the measurement signal, computer 
round-off and Instrument imperfections. 

book at the singular value d n of the matrix H rs (0). Choose a number 6 
based on measurement errors Incurred in estimating the elements of H rs (0) and/or 
round-off errors incurred in a previous computation to get them. If 6 is 
chosen as "zero threshold" such that 6 < d n , then the matrix H rs (0) is 
considered to have rank n. Unless information about the certainty of the 
measurement data are given, the number 6 is defined as a function of the 
precision limit in the computer machine. For example, 6 *d n /di cannot exceed 
the precision limit, further details are found in Ref. [11]. 

After the test of sir >lar values, assume that the matrix [P^H rs (k)Q] has 
rank n. Find the eigenvalues c and eigenvectors i such that 

V’kPjHrsWQ^ a z (*0) 

The modal damping rates and damped natural frequencies are simply the real and 
imaginary parts of s, after transformation from the z- to the s- plane using 
the relationship 

s - C(lnz) + 2j* ]/(kAx) (41) 

where at is the data sampling interval and j Is an integer. Note that k is 
generally chosen as 1 for simplicity. Although z and are in complex domain, 
computation of Eq.(40) can be performed in the real domain (Ref. 21) sirce the 
state matrix realized for most flexible structures has Independent eigenvectors. 

The triple [ z, ♦” 1 Q T E JB , Ej[P d + ] Js obviously a minimum order of realiza- 
tion simaly by observing Eq. (3$). eIp^ Is called sensor modal displacements 
and +“'Q'E m initial modal amplitudes. To quantify the system modes and noise 
modes, two indicators are developed as follows. 


307 



Modal Amplitude Coherence y 


If the information about the uncertainties of the measurement Is minimum, 
the rank thus determined by the SVD becomes larger than the number of excited 
and observed system modes to represent the presence of noises In modal space. 
In modal parameter Identification, the Indicator referred to as modal amplitude 
coherence Is developed to quantitatively distinguish the system and noise 
modes. Based on the accuracy parameter, the degree of the modal excitation 
(controllability) Is estimated. 

Ihe modal amplitude coherence Is done by calculating the coherence between 
each modal amplitude history and an Idea one formed by extrapolating the Initial 
value of the history to latter points using the Identified eigenvalue, let 
the control input matrix (Initial condition) be expressed 

♦‘ 1 Q T E m - C^.bj, b n ]*, (42) 

where * means transpose and complex conjugate, and the 1 x m column vector bj 
corresponds to the system eigenvalue Sj( j«^,...,n) . Consider the sequence 

<ij = [bJ.expltjATSjJbJ exp(t s _ 1 AxSj)bj] (43) 

which represents the ideal modal amplitude In complex domain containing 
informations of the magnitude and phase angle with time step At . Now, define 
vectors qj such that 

♦ -1 Q T * Cqi»q2 q n 3* ( 44 ) 

The complex vector qj represents the modal amplitude time history from the real 
measurement data obtained by the decomposition of „ne Hankel matrix, let Vj 
be defined as the coherence parameter for the jth mode, satisfying the relation 

Y j * l^j<ljl/(lflj<ljllflj <, jl) 1/2 < 45 ) 

where | | represents the absolute_value. The parameter yj takes only the values 
between 0 and 1 . yj ♦ 1 as qj ♦ qj Indicates that the realized system 
eigenvalue Sj and the initial modal amplitude bj are very close to the true 
values for the jth mode of the system. On tne other hand. If yj Is far 
away from the value 1, the jth mode Is a noise mode. However, to make a clear 
cut between the system modes and noises requires further studies. Obviously, 
the parameter yj quantifies the degree to which the modes were excited by 
a specific Input, l.e. the degree of controllability. 

+ 

Modal Phase Collinearlty u 

lightly damped structure, normal mode behavior should be observed. An 
Indicator referred to as the modal phase collinearlty Is developed to measure 
the strength of linear functional relationship between the real part and the 
Imaginary part of the sensor modal displacement (mode shape) for each mode. 
Based on the accuracy Indicator, the degree of the modal observation Is 
estimated. Define 
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Ep^ ~ [Cj jCg , • . • »Cp3 (46) 

where Cj(j=l,2,...,n) is the sensor modal displacement corresponding to the 
jth realized mode, let the column vector 1 of order p be 

1 T » CM 1] (47) 

in which p is the number of sensors. Now compute the following quantities 
for the jth mode shape. 

Cj = cjl/p (48) 

c rr = [Rea1(Cj-Cjl)] T [Rea1(Cj-Cjl)] (49) 

c ri = [Real(Cj-Cjl)] T [Imag(Cj-cjl)] (50) 

c^ = [Imag(Cj-Cjl)] T [Imag(Cj-Cjl)] (51) 

e = (cn - c rr )/2c ri (52) 

© * arctan[e + sgn(e)(l + e 2 ) 1 / 2 ] (53) 

where Real( ) and Imag( ) respectively arc the real part and Imaginary part of 
the complex vector ( ), and sgn( ) is vne sign of the scalar ( ). The modal 
phase collinearity yj for the jth mode Is then defined as (Ref .22) 

Uj = f c rr + c ri [2(e 2 +l)sin 2 (e)-l]/e }/(c rr +c 1i ) ; j«l,2,...,n (54) 


This indicator checks the deviation from 0° - 180° behavior for components of 
jth identified sensor modal displacement. The parameter uj takes only the 
values between 0 and 1. pj ♦ 1 indicates that the accuracy of the modal 
displacement is high. On the other hand, if yj Is away from 1, the jth mode 
is either a noise mode or high damping Is present. 

Model Reduction 


The dynamical system is composed of an interconnection of all the ERA 
identified modes. The accuracy Indicators allow one to determine the degree of 
individual mode participation. Model reduction can then be made by truncating 
all the modes with low accuracy indicators. The accuracy of the complete modal 
decomposition process can be examined by comparing a reconstruction of Y(k) 
formed by Eq.(35) with the orginal free decay responses, using the reduced 
model . 


IV. SUMMARY OF ERA 


A flowchart of the procedures to be followed to use ERA In system model 
identification is presented in figure 1. The computational steps are summarized 
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as follows: 


1. Construct a block -Hankel matrix H rs (0) by arranging the measurement 

data into its rows with given r, s, t< (1*1,2 s-1) and (i * 1, 2 

r-1), (Eq. 16). 

2. Decompose H rs (0) using singular value decomposition (Eq. 23). 

3. Determine the order of the system by examining the singular values of 
the Hankel matrix H rs (0) (Eq. 38). 

4. Construct a minimum-order realization (A, B, C) using a shifted block- 
Hankel matrix (Eq. 34). 

5. Find eigensolutions of the realized state matrix (Eq. 40) and compute 
the modal damping rates and frequencies. (Eq. 41). 

6. Calculate the coherence parameter (Eq. 45) and the collinearlity 
parameter (Eq. 54) to quantify system modes and noise modes. 

7. Determine the reduced system model based on accuracy Indicators, 
reconstruct function Y(k) (Eq. 35) and compare with measurement data. 

Note that the determination of r, s, ti and In Step 1 above requires 
further development. This determination Is related to the choice of the 
measurement data to minimize the size of the Hankel matrix H rs (0) with the rank 
unchanged. 


V. EXAMPLES: SIMULATION AND EXPERIMENT RESULTS 


To illustrate the ERA method, two examples are given. First, a numerical 
problem will be posed and solved for an assuned structure with distinct and 
repeated frequencies. Second, experimental data for a simple, two-dimensional, 
grid structure as shown In Fig. 2 Is used and realized In terms of a linear 
system. Experimental results are compared with those predicted by a finite 
element model. 

Numerical Simulation 

Figure 2 shows a representation of a typical flexible structure. The 
dynamical equation for this typical structure with Initial-state-response in 


terms of system modes In modal space can be written as: 

dg/dt - A g (55) 

y ■ T g (56) 

where A is a canonical matrix with the diagonal blocks {A] A^}, 
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g is the generalized modal amplitude and C is the generalized sensor influence 
matrix. The quasi -diagonal matrix Aj (j*l, — ,k) has the matrix form 


A j 



(57) 


The complex values 6j + Iwj are the eigenvalue of the frame structure. 

Given a model described as in Eq. (55), results of some numerical simulation 
using the ERA scheme can be summarized in the sequel. Two cases will be given 
including systems with and without repeated eigenvalues. The numerical test is 
performed by taking as “data" y the output values of the solution of a model 
with the form (55) whose parameters A, C and initial condition g(t 0 ) are 
known. In the analysis of physical systems, experimental methods generate the 
measurement data y. It is then desired to realize a system by using the data y 
and compare with the known model. 


Case I: A model with distinct eigenvalues 

Assume that parameters such as bending rigidity, mass density and damping 
coefficient of the assumed structure are adjusted to give 


A j 8 


-O.Olxj j 

-j -O.Olxj 


j 


2, 3, 4, 5 


(58) 


To Illustrate applications of ERA in a single input and single output case, A 
sensor is chosen and located to give 

C = [l.O.i.O, 1,0,1»0,1,0] (59) 


Let the initial condition for free decay responses be 

g T (t 0 ) = [0,1,0. 1,0, 1.0, 1,0,1] (60) 

Then the functions y with a sample time Interval 0.05 second generated 
from the model (55) with known parameters (58), (59) and (60) are used as 
measurement data for the ERA procedure. 

Using = tj = i and r=s=90 in Eq. 16, the ERA realization of a dynamical 
system is 

C = [0.709,2.529,-0.347,-1.706,0.814,-1.183,-1.382,-0.276,1.129,1.257] (61) 

g T (t o )=[0. 103 ,0.367, -0.1 14, -0.563, 0.395,-0.574, -0.696, -0.139, 0.396, 0.440](62) 

and A is identical to that shown in £q. (58) with the accuracy close to the 
precision limit of the computer. In the process .af realization, the number 
6 *d n /d 1 as defined in Eq. (38) is set to be 10~'S The singular values of 
the generalized Hankel matrix H rs (0) are 
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D = [49.86,44.84,33.69,27 .64,23.69,21 .04,1 3.57,10.95,6.374.5.508] (63) 

All the values dj (j -11,..., 90) which has the number dj/d^ less than 10"^ 
are considered to be zero. The rank of the Hankel matrix J K rs (0) is obviously 
ten which is identical to the order a priori given in Eq.(58). The realized 
state matrix is a minimum order of 10 and the eigensolutions are obtained from 
this 10 x 10 matrix. All the parameters for modal amplitude coherence (Eq.45) 
and modal phase collinearity (Eq. 54) are 100%. Although Eqs. (61) and (62) 
are a different realization from the system (59) and (60), they are equivalent 
in the sense that a unitary transformation and normalization will make them 
equal . 

By forming the matrices V in Eq. (5) and W in Eq. (9) with the aids of 
Eqs.(58)-(60) , the reader can see that this realization is controllable and 
observable. 


Case II: A model with repeated eigenvalues and Independent eigenvectors 
Assume now that the system model is represented by 


and 


A l = a 2 


- 0.01 1 . 0 ' 
- 1.0 - 0.01 


<).01xj j 
-j -O.Olxj 


j* 3, 4, 5 


(64) 

(65) 


Using the same process as last case, the ERA realization simply miss the repeated 
eigenvalue K\. The result Is expected since, by control theory for a linear 
system, single input or single output does not make a system with repeated 
eigenvalues and independent eigenvectors controllable or observable. It can 
be verified that the matrices V in Eq. (5) and U in Eq. (9) formed by Eqs. 
(59), (60), (64) and (65) have rank 8. Multi-input and multi-output must be 
used to realize such a system. Let two sensors be chosen and located such 
that 

[ 1 , 0 , 1 , 0 , 1 , 0 , 1 , 0 , 1,01 


C = 


( 66 ) 


[ 0 , 0 , 1 , 0 , 1 , 0 , 1 , 0 , 1,0 


and two initial conditions for free decay responses 



" 0 , 1 , 0 , 1 , 0 , 1 , 0 , 1 , 0 , 1 " 
0 , 0 , 0 , 1 , 0 , 1 , 0 , 1 , 0 , 1 . 


(67) 


Note that the rows in Eqs. (66) and (67) are Independent. For each initial 
condition, a series of "measurement" function y with a sample time Interval 
0.05 second can be generated from the model (55) where each y in this case is a 
vector with two elements for two different sensors. The free decay function 
Y in Eq. (15) is then a 2 x 2 matrix. Using that ji * t^ * 1 and r * s * 45 
for Eq. (16), the ERA realization for a dynamical system is then 
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(68) 


C = 


0.135,-1.686,0.155,-0,172,0.111,-0.032,0.099,0.035,0.195,0.177 

-0.004,0.107,0.142,-0,136,0.111,-0.032,0.099,0.035,0.195,0.177 


g T (t 0 ) 


-0. 014, -0.457, 3, 840, -3. 692, 8. 338, -2. 406, 8. 9 F . ,3. 131, 2. 818, 2.554 
-0.051 , 0.092,3.605,-3.508,8.338,-2.406,8.956,3.181 ,2.818,2.554 


(69) 


where A is identical to that shown in Eqs. (64-65). The singular values Dare 
D = [70.16,44.32,37.97,25.25,1 1.18,9.050.7.950,3.873,0.1 27,0.026] (70) 


The seme error window 6=d n /d-j as last is used. All the parameters for modal 
amplitude coherence and modal phase colinearity are 100%, Again, Eqs. (68-69) 
and Eqs. (66-67) are equivalent in the sense that a unitary transformation and 
normalization will make them equal. The reader can easily verify that this 
realization is controllable and observable. 


Sample Experimental Results 

A sample set of modal Identification results that have been obtained from 
laboratory .est data using ERA are included in this section. The test article, 
shown in Fig. 2, is a 7 ft by 10 ft flexible grid structure that will be used at 
NASA Langley for vibration control experimentation. It is constructed of over- 
lapping aluminum bars of 1/4 In. by 2 in. cross section, riveted together at 
one-foot intervals. Four rivets are used at each joint to provide a tight 
connection. The structure is sc -pended from a stiff overhead beam using two 
short cables attached to the top horizontal member. The results to be shown 
are from a preliminary dynamics test of the grid. It was conducted by exciting 
the structure with an air jet and measuring the free vibration response using 
nine non-contacting proximity sensors. The response sensors were attached to 
a stiff frame located adjacent to the grid for the measurement of out-of-plane 
motions. Eight different excitation frequencies corresponding to resonant 
responses were used. The sampling rate was 32 samples per second. 

The ERA analysis was performed using a single matrix of data from all 
nine response measurements and eight Initial conditions. Each response func- 
tion Y as shown in Eq. (16) was thus a 9 x 8 matrix. The Hankel matrix H rs 
of 72 rows by 400 columns was formed to perform the analysis. Table 1 provides 
h comparison rf the ERA results with analytical prediction from a NASTRAN 
finite-element model. The entries in the center of table are correlation 
coefficients in percent between each ERA-1 dentlfied mode shape and each NASTRAN 
mode shape. High correlation values Indicate good agreement between the two 
shapes. The results show reasonable agreement In both frequencies and mode 
shapes, except for the damping result of the first mode. The main reason for 
the first mode discrepancy is inadequate data length. Only 50 data points were 
used which corresponds to less than one cycle of data for the first mode. The 
results can be improved by using more data points. Note that few high correla- 
tions occur for some modes with significantly different frequencies. This Is 
because only 9 sensors were used In comparison. More detailed experimental 
results for a complex structure are shewn in Ref. [19], 
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CONCLUDING REMARKS 


An Eigensystem Realization Algorithm (ERA) is developed for parameter 
identification and model reduction for dynamical systems. Twi developments 
are given in this paper. First, a new approach is developed to derive the 
basic ERA formulation of minimum realization for dynamical systems. As by- 
products of this approach, two alternative less powerful algorithms, identi- 
fied as A1 and A2, are derived. A special case of A1 Is shown to be equiva- 
lent to an approach currently in use in structural dynamics. Second, accu- 
racy indicators are developed to quantify the partlpatlon of system ■"C^es 
and noise modes in the realized system model. In other words, degree of 
controllability and observability for each participated mode is determined. 
A model reduction can then be made for controller design. 

important features of the ERA algorithm are summarized as follows. 

(1) From the computational standpoint, the algorithm is attractive, since 
only simple numerical operations are needed; (2) the computational procedure 
is numerically stable; (3) the structural dynamics requirements for modal 
parameter identification and the control design requirements for a reduced 
state space model are satisfied; (4) data from more than one test can be used 
simultaneously to efficiently identify the closely spaced eigenvalues; (5) no 
restrictions on number of measurements are imposed. 
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Table 1 : Comparison of the ERA results with the NASTRAN Model 


MODE 2 
CORREI 

5HAPE 



ANALYTICAL FREQUENCY (NASTRAN) 

^ HZ 




DECAY 

Y 


tATION ' 

h— — — 

.362 

.612 

1.425 

2.331 

2.826 4 

.936 

5.476 6. 

429 7 

.047 

9.841 


RATE % 

(ERA1 

.363 

99 

1 

21 

41 

l 

42 

17 

1 

0 

1 


23.1 

71 

100 

.584 

3 

99 

3 

0 

1 

l 

2 

98 

43 

9 


0.33 

100 

59.9 

1 F 
0 R 

1.376 

22 

1 

96 

36 

3 

12 

45 

2 

2 

3 


0.64 

99.8 

99.9 

E E 

n q 

2.047 

38 

0 

8 

94 

7 

1 

1 

1 

0 

7 


0.33 

100 

100 

T U 
I E 

2.542 

2 

6 

4 

3 

96 

2 

9 

3 

15 

96 


0.26 

100 

99.9 

F N 
1 C 

4.852 

48 

11 

22 

35 

3 

70 

65 

10 

1 

3 


0.19 

100 

S9.9 

E Y 
D 

5.095 

9 

9 

29 

19 

2 

46 

81 

8 

6 

2 


0.61 

99.5 

97.5 


7.403 

4 

73 

1 

5 

3 

8 

0 

79 

91 

10 


0.12 

100 

99.4 


9.519 

0 

19 

8 

0 

90 

2 

9 

8 

22 

91 | 

L 

0.24 

97.3 

99.9 


y : Modal Amplitude Coherence 
u : Modal Phase Col linearity 
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ABSTRACT 

This p*P tr advances an approach for state estimation and identification of spatially 
distributed parameters embedded in static distributed (elliptic) system models. 

The method of maximum likelihood is used to find parameter values that maximize 
a likelihood functional for the system model, or equivalently, that minimize the 
negative logarithm of this functional. To find the minimum, a Newton- Raphson search 
is conducted that from an initial estimate generates a convergent sequence of 
parameter estimates. Central to the numerical search are a gradient functional and a 
Hessian operator, which are respectively the first and second function-space 
derivatives of the negative-log likelihood functional with respect to the parameter 
distributions being identified. For simplicity, a Gauss-Markov approach is used to 
approximate the Hessian in terms of products of first derivatives. The gradient and 
approximate Hessian are computed by first arranging the negative -log likelihood 
functional into a form based on the square-root factorization of the predicted 
covariance of the measurement process. The resulting data-processing approach, 
referred to here by the new term of predicted-data- covariance square -root filtering. 
makes the gradient and approximate Hessian calculations very simple. Since the 
parameter estimates are only approximations to the actual parameter values, there is a 
parameter estimation error inherent in the estimation process. Cramer- Rao bounds are 
obtained for the covariance of the estimation error in terms of the information 
operator associated with the likelihood functional. These error covariance bounds are 
then used to outline methods for optimal input design. 

A closely related set of state estimates is also produced by the maximum likelihood 
method: smoothed estimates that are optimal in a conditional mean sense and filtered 
estimates that emerge from the predicted-data- covariance square-root filter. The 
terms "smoothed" and "filtered" are used because the formulas which generate these 
estimates, when expressed in operator notation, are symbolically very similar to those 
used in filtering and smoothing for linear dynamical systems. A key similarity is the 
presence of a predictor- corrector structure containing estimator gains that, as in a 
Kalman filter, can be expressed in terms of the state estimation error covariances. In 
addition, a residual process can be defined, in the usual way, as the difference between 
the actual data and the predicted data obtained from the filtered state estimate. The 
residuals have properties nearly identical to those of an innovations process: the 
residuals are white with a unit covariance; and the residuals and measurements can be 
obtained from each other by means of reciprocal linear transformations. Because these 
transformations are not Volterra (causal), the residuals are not a bona fide innovations 
process. Even though they are not a true innovations process, the residuals are very 
useful, because they lead to state and parameter estimation schemes for elliptic 
systems that retain conceptually the simplicity of those obtained by the innovations 
approach to filtering, smoothing and identification for linear dynamical systems. 
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1 . INTRODUCTION AND SUMMARY 


The elliptic models considered in this paper can be cast as 

A(0)u(0) = B(0)o + C(0)f, (1.1) 

y * H(0)n(0) + n, (1.2) 

where A is a formally self-adjoint elliptic differential operator defined over the spatial 
domain Q; B and C are appropriately dimensioned operators that model the influence of 
the process error u and the input f on the state U{ H is an operator that characterises 
the state-to-observations map; u and n are white-noise model errors forming the model 
error vector c-[o,n]; and f is a deterministic input. Examples of the application of such 
models to the problem of static shape determination of large space structures are 
contained in Ref. [1]. 

The central aim here is to develop a maximum-likelihood approach to the 
estimation of the parameters 0 (these parameters could in general be spatially 
distributed) by using the data y and the system model itself. It is assumed that the true 
value 6 q of the parameter 0 is a deterministic but poorly known quantity. The input f 

can be selected to optimize the data generated for estimation. A related but somewhat 
secondary aim is to develop a methodology for computation of the corresponding state 
estimates. 

A Formula for the Nexative-Loa Likelihood Ratio 

It will be shown in Sec. S that the negative-log likelihood functional is specified by 

J(6;y) * V. Tr log [I+R(0)J + */. [y-m(0)]*lUR(0)l _1 [y-m(0)] - V. y*y, (1.3) 

where 

m(0) * H(0)<t>(0)C(0)f and R(0) - H(0)d>(0)B(0)B*(0)<t>*(0)H*(0). 

The integral operator <W0) is related to A(0) by A(0)dK0)»I, with I the identity. The 
symbol <b* denotes the integral operator adjoint to <t> so that $*<0)A*<0)=I. It will also 
be shown in Sec. 3 that m(0) and R(0) are respectively the "suspected" mean and 
covariance of the data y, under the assumption that the model error vector c«(<o,n] is a 
spatially distributed white-noise process [1] with a covariance operator B(ee*)«I equal 
to the identity. To simplify Bq. (1.3). the following notation has been used: 

y*y. <y,y> # and [y-m(0)]*|I+R(0)J~ 1 [y-m(0)J - <y-m(0)], [UR(0)r 1 Iy-m(0)]> > , 

where <*. •> indicates an inner product in the function space to which the data belongs. 
Predicted- Data- Covariance Square- Root Form of the Likelihood Ratio 

A numb er of alternative formula* for the negative-log likelihood functional are 
developed in Sec. 4. To solve the above minimization problem, the most convenient 
formula is: 


320 




J(6sy) * Tr log[UE(6)] 4 V* z*(0)z(9) - z*(0)y, 


(1.4) 


where 


z(0) = L(0)y + lI-L(0)]m(0), (1.5) 

L(0) = I -[I+R(0)f V \ K(0) - [I+R(0)] V, -I. (1.6) 


Equation (1.5) can be viewed as specifying a filter, characterized by the operator L(0), 
that processes the data y and the suspected mean m(0) to provide a "filtered" state 
estimate z(0). This filter L(0) will hereafter be referred to as the predicted 
data -covariance square-root filter because the key calculation required to specify L(9), 
as in (1.6), is the evaluation of the square-root of the predlcted-data-covariance 
operator [UR(0)]. The equivalence between (1.3) and (1.4) can be established by 
substitution of (1.5) and (1.6) into (1.4). 

Note for later reference that the definitions in (1.6) imply that K(9) and L(0) are 
related by 

[I+K(0)) _l = I-L(0). (1.7) 

Furthermore, (1.7) implies that K(0) * L(0)+K(0)L(0) * L(0)+L(0)K(0). 

Gradient of the Likelihood Functional 


The gradient functional dj/d9, to be defined more completely in Secs. 5 and 6, is 
specified by 

3J/30 = Tr|OL/30)(I+K)] 4 (z-y)*«z/a0), (1.8) 


where 


3z/39 = (3L/36)y 4 (I-L)(3m/30), 


(1.9) 


with y = y-m, and 3L/30, dm/30 being the function-space Frechet derivatives of L and 
m. These equations can be obtained from (1.4) by function- space differentiation with 
respect to 9. 


The gradient functional 3J(0;y)/30 in (1.8) is the Frechet derivative [2) of the functional 
J with respect to the parameter 0. The derivative is a linear transformation (assumed 
to be bounded) that maps an admissible parameter perturbation 60 into the 
corresponding perturbation 6J(0,60;y) of the likelihood functional by means of the 
equation 6J(0,60;y) * (3j(0sy)/30] 60. Detailed computation of the function- space 
derivatives above is conducted in Sec. 6 using a perturbation analysis of the 
eigensystem of the covariance operator R ■ obtained in Sec. 5. 

Note that in Sec. 7 it will be established that 


Eiaj^yi/aoi ^ 

0*0 O * 0, 


( 1 . 10 ) 
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so that the expected value of the gradient vanishes at the optimal parameter value 9 0 . 
Hessian of the Likelihood Functional 


Similarly* differentiation of (1.8) leads to 

a’j/ae’ » Tr io a L/d9 a ) (i+K) ♦ OL/ae) OK/ae) i + (z-v)*o s z/39 a ) + 


(3z/39)*(dz/ 39), (1.11) 

and to its expected value at 9*9 of M(9 ) » B[3 a J/39 a ]U « , i.e., 

o o 


B(a a i/89 a j ie,e * Tr lOL/ae) a+ k) «l*/ 39)] ♦. B[Oa/ae)*o*/a9)]. d . 12 ) 


Furthermore* substitution of (1.9) in the last term of (1.12) leads to 

B[a a J/39 a J | e , e * 2Tr |OL/a0) a+R) OL*/39>] + [d - L) (3m/30)J* 

0 

[d - L) (dm/39)]. (1. IS) 

Note that the expected value of the Hessian operator d a J/86 a evaluated at 9*9. is a 

c 

sum of two terms each of which is positive definite. Consequently* in a probabilistic 

sense made precise by (1.13)* the likelihood functional is strictly convex in the vicinity 

of the optimal value 9*9 . Note that by definition M(9 ) in (1.13) is also the 

o o 

information operator associated with the likelihood functional. 

Newton- Ranhson Search for t >»« Optima l Parameter Estimates 

Since the problem of minimization of J(9$y) in (1.4) has no closed-form solution, it is 
necessary to consider iterative numerical search techniques for optimization. The 
following constitutes a function-space Newton- Raphson iteration: 

e n+1 *9 n -M‘ l g . <1.14) 

n n 

where g Q = 3J(9°5y)/39 is the gradient functional (1.8) evaluated at 9*9 n j and where 

Mq - Tr K3L/39) (I+R> OL*/39)J 4 (3z*/39) ( 3 z/ 39 )|q .0 (1.15) 

Q 

is an approximation to the Hessian operator 3 a J/39 a in (1.11). This approximation is 
obtained from (1.12) by replacing the second term B[(dz*/39)(dz/d9)) with the actual 
value ((3z*/39X3z/39)J obtained in a single realization. Under certain conditions, to be 


322 



examined in more detail in future work, the sequence 6 n specified by (1.14) converges 
to a local minimum of J(6;y), if the initial estimate used to start the search is 
sufficiently close to the optimal value. 


Cramer- Rao Bounds and Optimal Input Design 


The above numerical search results in an estimate 0 of the actual parameter value 0 Q . 


In Sec. 7, a C-R bound for the covariance B(6 6 *) of the estimation error 6 

P P P 


obtained from the inequality. 


0-6 is 
0 


E(0 0 *) 5 M - 1 (6 ), (1.16) 

P P o 

where the information opera or M(0 o ) is specified in (1.13). The related mean-square 
estimation error is bounded by £(0^*0^) 2 TrlM - *(0 Q )]. 

The information operator M(0 q ) can also be used to specify criteria for optimal input 

design. Perhaps the simplest optimal selection method to implement is that which 

seeks to maximize Tr [M(0 )] with respect to f, subject to the constraint that f satisfy 

o 

the normalization condition of f*f=l. This method results in an optimal f which is the 
eigenvector corresponding to the largest eigenvalue of a positive-definite matrix 
described in detail in Sec. 7. Other criteria for optimal selection based on calculation 

of M -1 (6^) may be more difficult to implement but usually lead to superior 

performance. 

The Corresponding State Estimates 

Related to the parameter estimation approach are the following two distinct state 

estimates (denoted by u and z ): 

o o 

u o * E(u/y) * 4>Cf + G(y-H«PCf), z q » <PCf 4 g(y-H<bCD, (1.17) 

where G and g are Kalman-like gains (see Sec. 8) specified by 


G = £ sin a a^Xj,<pj.* F g * £ (1-cosa^) Xj^*. (1.18) 

In these equations, are the eigenvectors of the operator R - so that 

R4>k* X a <|> k , with \ a being the related eigenvalue*. Also, and Xj, are defined by 
tano^ * \.j, and Xj, = \~ a <bBB*<P*H*^. 

The state estimate u Q » E(u/y) is defined as the conditional expectation of the state 
given the data y. Since u q is an optimal estimate of u based on the entire data set (as 
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opposed to s subset), u Q can be viewed as a best smoothed estimate. The other 

estimate, z o in (1.17), will be referred to as • filtered state estimate. Ths filtered 

estimate has no known probabilistic interpretation similar to u q - B(u/y) above. 

However, in spite of the apparent lack of probabilistic meaning, estimate is useful 
in simplifying the gradient and Hessian calculations in (1.8) and (1.11). It will be shown 
in Sec. 8 that z q in (1.17) and z, the estimate emerging from 

the predicted-data- covariance square-root filter, are related by z « Hz . Hence, z is 

o o 

a bona fide estimate of the entire state, whereas z - Hz is a partial estimate defined 

o 

only at the observation locations. 

Kalman-like Gains and Error Covariances 

The gains G and g in (1.17) can alternatively be specified in terms of the 

covariance of the state estimation error inherent in' u and z , i.e., 

o o 

G * PH*, g » pH*, (1.19) 

where 

P * E((u-u o ) (u-u o )*], p p* » E[(u-z o ) (u-z q )*]. (1.20) 

The corresponding mean-square state estimation error is 

E[(u-u )*(u-u )J - Tr [f], B[(u-z )*(u-z )] - Tr [p+p*J. (1.21) 

0 0 0 0 

Furthermore, f and p are related by 

P = p + P - pH*Hp. (1.22) 

Since the term pH*Hp is non-negative, the mean-square estimation error associated 

with the smoothed estimate u is never larger than that of the filtered estimate z . 

o o 

Filtering end Stwnnthiwg 

While u and z have been defined some shat independently in (1.17), they are 
o o 

related by: 

u » 2 4 pH*e, (1.23) 

o o 

where 

e * y - Hz q - « - HpH*)y - (I - L) y (1.24) 

is the residual process defined as the difference between the data y and the 
observed-state estimate Hz Q . The symbol y in (1.24) denotes the mean-centered data 
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process y * y - H<*>Cf. It will be shown in Sec. 8 that (1.22) and (1.23) constitute a 
generalization to elliptic systems of the forward/backward sweep method for solution 
of smoothing problems in linear dynamical systems. 

The Residuals as a Pseudo- fawwtin ng Process 

The residuals in Bq. (1.24) have two properties that are similar (but not identical) 
to those of an innovations process: 

B(ee*) = I, (1.25) 

e = (I - L) y, y * (I ♦ K)e. (1.26) 

Eq. (1.25) reflects whit ness of the residuals. Bq. (1.26) states that the residual and 
mean-centered data processes e and y can be obtained from each other by means of 

reciprocal transformations, i.e., (I + E)~ x * (I - L) as in (1.7). Whiteness of the 
innovations and reciprocal relationships between innovations and measurements are the 
two central features of the innovations approach to least-squares estimation for linear 
dynamical systems. Bqs. (1.25) and (1.26) are similar to these conditions. However, 
there is a key difference: the transformations (I + K) and (I - L) in (1.26) are Fredholm 
operators whose domain is the entire measurement space. This is in contrast to the 
Volterra (causal) operators in the innovations approach for linear dynamical systems. 
The notion of causality is not even used in this paper, although such a notion can be 
defined for certain classes of elliptic systems [1]. Because of this difference the 
residual process is not a bona fide innovations process. However, the residual process is 
stili useful in obtaining the relatively simple formulas in (1.8) - (1.26) for filtering, 
smoothing and identification. 

Paper Outline 

This section has at a summary level addressed many of the fundamental issues involved 
in the maximum likek_.>od approach to estimation. The subsequent sections of the 
paper contain a more complete description of the above results. 

Section 2. Development of the mathematical framework -- including integral 
operator models, a priori covariance analysis with white-noise model errors, Predholm 
resolvents, and eigenfunction expansions — required to arrive at formula (1.3) for the 
likelihood functional and to evaluate the corresponding function-space gradient in (1.8) 
and the approximate Hessian in (1.15). 

Section 3 . Derivation of the negative-log likelihood functional in (1.3). This 
functional is the negative logarithm of the likelihood ratio, associated with the 
detection of a Gaussian signal in additive Gaussian noise, traditionally encountered in 
the theory for communication and signal detection. 

Section 4. Development of alternative formulas for the likelihood ratio, some of 
which are more convenient to use than (1.3) in implementing the numerical search for 
optimization — in particular, development of the predicted-data-covariance 
square- root filter form (1.4) upon which the Newton- Raphscn search is based. 
Additional forms of the likelihood ratio which are of interest in their own right 
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(although not subsequently used in the paper) are: a smoothing form expressed in terms 
of the best mean-square state estimate; an eigenaystem expansion form based on the 
eigenvalues and eigenvectors of the operator in (1.3); a trigonometric 

operator form with which most of the manipulations involved in the may; mum likelihood 
approach can be visualized using their similarities to simple trigonometric formulas fo* 
scalars. 

Section 5 . Development of a first-order perturbation analysis to evaluate the 
infinitesimal changes in the eigensystem of the operator R«H$BB*<h*H* in (1.3) due to 
similarly small changes 66 in the parameter distributions being identified. This is the 
central calculation required to compute the function-space gradients 3J/80, 3z/30, 
3L/39 and 3m/30 in (1.8) and (1.9). 

Section 6 . Calculation of the gradient functional and approximate Hessian of the 
likelihood functional based on the perturbation analysis of Sec. 5. These are the two 
calculations which are central to implementation' of the Newton- Kaphson search and 
which have been used as a basis for computer programs to Implement the maximum 
likelihood approach. 

Section 7 . Parameter estimation error covariance analysis and Cramer- Rao bounds 
based on explicit formulas for the Hessian (information) operator in (1.13). Outline of 
an optimal input design approach based on using the Cramer- Rao bound as an optimality 
criterion. 

Section 8 . Analysis of the filtered and smoothed state estimates embedded in the 
parameter estimation approach. Analysis of the predicted~dat« -covariance square-root 
filter resulting in Kalman-like formula? for the filter gain, evaluation of the state 
estimation error covariance, and relationships between filtered rod smoothed estimates. 

Section 9 . Summary and explanation of calculations required for implementation of 
the numerical search for the optimal estimates. 

Section 10 . Conclusions and directions for future work in the areas of development of 
as; iptotic properties of the estimates and of optimal input design. 

2. PRELIMINARIES: Notation, Integral Operator Model, Covariance Analysis, 
Predholm Resolvents, and Bigenf unction Expansions 

The aim of this section is to develop a set cf miscellaneous results that will be useful in 
subsequent sections in conducting detailed derivation cf: the negative-log likelihood 
functional in (1.3) to be minimized, the corresponding function-space gradient in (1.8), 
and the approximate Hessian operator in (1.15). The main results of the section can be 
summarized as follows: 

• conversion of the partial differential operator model in (1.1) to an 
equivalent integral operator formulation. This integral operator 
formulation is introduced because it simplifies the statement and solution 
of the estimation problems in (1.1) - (1.3). 
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evaluation of the observed state covariance operator in 

(1.3), under the assumption that c = [o. ,n] is a spatially distributed 
white-no se process with a unit covariance operator. Related to evaluation 
of this covariance operator R is the similar evaluati^ of the suspected 
mean m = ri<t>Cf in (1.3). 

evaluation of the dual observed-state covariance operator Q=3*<b*H*H<&B 
- which can be viewed as xhe covariance of the output of a system model 
dual to (1.1), unde the assumption that this dual system is driven by a 
white-noise process. 


• definition of two sets of eigenvectors and of R and Q above, with X* 

being a set of common eigenvalues, xhese two sets of vectors can be used 
to expand functions in the input space and the output space Hg. 

* 

• definition of u vectors x^_ and p^ in the state space and its dual IT ? , 

related to and <|>j, above by ^ * X” 1 d>B^ and Pj, « Xj, l $*H*0j,. These 

two sets of vectors and satisfy a boundary-value problem similar to 

those traditionally encountered as necessary and sufficient conditions for 
optimality in quad atic optimal control rnd estimation problems subject to 
linear constrain 

• analysis cf the basic relationship between R and Q above and their 

corresponding Fredholm resolvents P and S defined as P - I-(UR) 1 and 

S - I -d+Q) -1 . Exp if sion of the operators R, Q, P and S in terrrs of the 
eigenfunctions <t>^ and ^ defined above. 

• development of trigonometric operator forms for R and ?. These 
trigonometric forms allow development of ir* ’.resting trigonometric 
alternatives to (1.3) in evaluating the likelihood lu .ticnal. 

While the section concentrates on the development of a mathematical framework to be 
used in subsequent sections, many of the above results (such as tht trigonometric 
operator formulas for the covariance operators) are of interest in their own right, 
somewhat independently of their subsequent application. 

Hilbert Space Notation 

There are three Hilbert spaces of primary interest: the input space H^ to which 
the process error a and the deterministic input f belong; the state space containing 
the state u; and the measurement space H* where the data y and the observation error 
n belong. The inner product between two arbitrary elements u and v in the apace H^ Is 
denoted by <u,v> i or fry the simpler notation u*v * <u,v> i - Similarly, uv* denotes a 
Hilbert space outer product. 
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Conversion to Integral Operator Model 


It is convenient for subsequent developments to convert (1.1) to sn eqoivelent 
integral operator formulation, To this end, define the Green'* function $(x/£) of A as 
the solution of 

A x <Kx/C)*6<x-C), (2.1) 

where 6 is the impulsive delta function, and where the subscript x in A^ denotes that 

the spatial differentiations embedded in A are performed with respect to x (as opposed 
to being performed with respect to £)• Define then the integral operator <t> whose 
kernel is the Green' $ function, i.e., 

4>v = <J>(x/£)v<£)d£, (2.2) 

for all admissible functions v. Note that <t> is the integral operator such that A<t> = I, 
where I is the appropriately dimensioned identity. 

With these definitions at hand, it is possible to recast (1.1) and (1.2) as 

y = m(0) + H(0)4>(0)B(0)u + n, (2.3) 

where m(6) is the “suspected mean* 

m(0) = H(0)<t>(0)C(0)f. (2.4) 

Equation (2.3) can be cast into the following even more compact notation 

y = m(0) + h(0)c, (2.5) 

where c = [t»,n] is the model error vector [1], and h(0) is the operator h(0) * 
U«0XJ>(0)B(0) 1 1 ). 

Predicted- Data Mean and Covariance 


The evaluation of the predicted mean and covariance of y, needed as a preliminary 
step to arrive at (1.3), is based on the key assumption that the model error vector c * 
[o,n] is a zero- mean spatially distributed white-noise process whose covariance 
operator E(ee*) is the identity, i.e., 

E(££*) = I, (2.6) 

where I is the appropriately dimensioned identity. Note that thir assumption is not at 
all restrictive, because the more general case where the model errors c » [o>,nJ are 
correlated (with a nonidentity covariance operator) can be handled within the same 
formulation hy selection of the operator B in (1.1). It is assumed here that BB* is 
bounded and trace-class, with kernel b(x/{) satisfying Jq b(x/x)dx <°°. 
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Remark 2.1 


The process y is s random field with mean and covariance specified as 


E(y) = m(0), E{(y-m(6)] [(y-m(e)J*} * I+R(0), (2.7) 

where R(0) = H(0)d>(0)£(0)B*(0)O*(0)H*(0). That E(y) * m follows from (2.5) and the 
fact that c i* zero-mean. The second of Eqs. (2.7) follows from the following sequence 
of operations: B((y-m)(y-tn)*] = E(bee*h*) = hE(ee)h* * hh* = I+R. A more detailed 
development of the above result is contained in Ref. [1]. 

Remark 2.2 The process u in (1.1), representing the state of the system, is a random 
field with mean and covariance specified by 

E(u) = <DCf, E((u-<J>Cf)(u-<t>Cf)*J » R(0), (2.8) 

where 

R(0) = <t>(0)B(6)B*(0)<t>*(0). (2.9) 

Note that the state covariance R and the "observed-state” covariance R in (2.7) and 
(2.8) are related by 

R(0) = HR(0)H*. (2.10) 

Pemark 2 .°. The state covariance R satisfies the partial differential equation 

ARA* = BB*, (2.11) 

a result which can be established by pre- multiplication of R in (2.9) by A and 
subsequent post-multiplication by A*. 

Remark 2.4 The state covariance operator $ can be represented as the following 
integral operator 

Rv = r(x/f) v(f)df , (2.12) 

where the kernel r(x/£) satisfies 

A x r(x/f)A f » b(x/f), (2. IS) 

and where b(x/£) is the kernel of BB*. This result can be established by means of the 
following sequence of operations. Consider an admissible function v (admissible in the 
sense that it can be operated on by the operators AfiA* and BB* in (2.11) so that 
ARA*v = BB*v makes sense). In terms of the corresponding kernels r and b, this last 
equation becomes 

A x { J Q r(x/C> [A f *v(f)Jdf} - J Q [A x r(x/c>A f ] v(f)df . J ft b(z/C>v(f>dC, (2.14) 

where the first equality is valid because by definition A^* is the formal adjoint of A^. 
Since Eq. (2.14) must be valid for all admissible v, then (2.14) implies (2.13). 
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Remark 2.5 The state covariance kernel r in (2.12) can be expressed as 


tix/O = I Q J Q «b(x/n) wve) <*8/E> dude, (2.15) 

where <t> is the Green's function of A, and b is the kernel of BB*. This result can be 
established by expressing (2.9) in terms of the operator kernels ♦ and b of $ and BB* 
and by subsequent reversal of the order of integration. 

PwmjrV i t In the special case, of interest in many applications, where the process 
error <■> is discretely located at the M locations and the sensors are placed 

at the N locations [f^,... then R * H<t>BB*<b*H* is a matrix whose general element 

R^. is specified by 

M 

R..= l ♦(x i /n l )<Xu l /f J ), (2.16) 

1*1 


where the summation is taken over the disturbance locations. 

The Dual-Model Covariance Operators 

Closely related to R and R above are the "dual" operators defined as 

Q(O) * B*(exb*(e)H*(e)H(exwe)B(0), q (©> . weiiwemoywe). (2.i7> 

Note that Q and Q can be obtained, from R and R respectively, by making the 
substitutions <h-*b* and B-»H*. This observation can be used as the basis for defining the 
dual system model, illustrated in Fig. 2.1, whose state and output have covariance 
operators specified by Q and Q above. 
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Figure 2.1. Illustration of Primal and Dual Models 
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The primal system model is based on (2.3), with a, u and Hn denoting the process error, 
the system state, and the observed state respectively. For this model, 5 » B(uu*) is the 
covariance of the state, while R » HRH* is the corresponding covariance of the 
observed state. It is assumed for the sake of this discussion that the deterministic input 
f in (1.1) has been set to zero, so that the suspected mean m in (2.3) is zero also. With 
this assumption, it is not necessary to show m in the block diagram in Pig. 2.1, and the 
relationship between the primal and dual models is illustrated more easily. The dual 
system model is characterized by the dual operators 9*, d>* and B*, by the dual or 
adjoint state X, and by the observed dual state B*X. It is assumed that the dual model 
is driven by a unit- covariance white-noise process so that Eton*) * I. This input process 
n driving the dual model can be thought of as being the observation error process in 
(2.3). Por this dual model, § * B(XX*) and Q « B(B*XX*B) are respectively the 
covariances of the state X and the observed state B*X. Upon multiplication of X in 
Fig. 2.1 by A*, the following partial differential equations result to describe the dual 
model 


A*X = H*n. (2.18) 

Note that the dual state covariance Q * and the dual observed-state 

covariance Q = B*0*H*Hd>B are related by 

Q . B*QB. (2.19) 

In the same spirit used to arrive at (2.11M2.16), it is now possible to develop the 
following properties of the operator 0 and its corresponding kernel q. 

Remark 2-7 The dual-state covariance operator (J satisfies 

A*QA » H*H, (2.20) 

a result which can be obtained from Q * <b*H*H<b in (2.17) upon multiplication 
by A*( *)A. 

Remark 2.8 In terms of its kernel q- the operator Q can be expressed as 

Qv - X Q q(x/£)v(C)dC, (2.21) 

where q satisfies the differential equation 

A^q(f/x)A x = h(£/x), (2.22) 

and where h(£/x) is the kernel of H*H. This result can be established by an approach 
quite similar to that used in arriving at (2.13). The symbbl v denotes again an 
admissible function defined to be admissible if (2.21) makes sense. 

2.9 The kernel q(?/x) of 0 can be expressed as 

q(£/x) - X Q f Q h(Tvtf) <*X/B) du dfi, (2.23) 
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where h is the kernel of H*H. This resole can be established in s meaner analogous to 
that nsed in arriving at (2.15>. 


Remark 2.10 In the special c- <e most typical in applications, where the range of H 
and B is finite-dimensional, the. Q » B*<b*H*H<bB is a matrix whose general element 
Q„ can be expressed as 

N 

Q i)“ £ q(x } /T V’ <2.24) 

k-l 

where the summation is taken over the set of sensor locations. 

Spectral Repre s entations 

Recall that R * H$BB*6*H* is the observed-state covariance operator in (2.7). 
The eigenvalues of R are defined as the nontrivial solutions of 


R<P k « X’dy (2.25) 

with 4> k being the corre sp onding eigenvectors. Note that, far cases where H has a 

finite-dimensional range, the operator R is an N-by-N matrix with a finite number of 
eigenvectors. In the more general case where the range of H is infinite -dimensional, 

then R is usually compact and X^-*0 as b*». In both of these cases, the following 

Mercer expansions hold for R and its kernel r 

R » Z ♦♦ and t(x/C> - E Xjg^xttJlf) (2.26) 


Furthermore, the normalized eigenvectors form an orthonormal basis for the 
observation space H^. This implies that E $^4^* * 1, where I is the identity in H^. 


Closely related to the basis $ k above are the dual vectors defined as 


♦ k * X~ l B*d»*HN»j., (2.27) 

which can be viewed as the result of applying an in put to the dual system model 
(2.18) and then "balancing* the output by dividing by the eigenvalue X^. 

Remark 2.11 The vectors P k defined by (2.27) are the eigenvectors of the dual 
observed-state covariance Q » B*<b*H*H$B, i.e., 

Q* k • X k P k . (2.28) 
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This result can be established by premultiplication of (2.27) by H<t>B and use of the 

condition R4> k - X*<|> k . Note that, if the dimension of the input space Hj is greater 

than that of the output space H^, then the do not span the input space. They do, 

however, span the range subspace of the operator B*4*H*. Consequently, they r«mnt 
be used to expand vectors in the null space of H$B. 

Remart 2.12 The vectors * k are also related to $ k by the equation 

4> k * X~ l H<t>B* k , (2.29) 

a result that can be obtained from (2.27) upon p re multiplication by the operator Hd>B 
and use of the condition R$ k * 

Remart 2 1 $ The dual-state covariance operator Q and its corresponding kernel q 
can be expressed as 

q = i 7 -^kV q a E (2 m 

a set of equations which are analogous to (2.26). This result can be obtained from the 
observation that Q = B*<b*H*H<t*B » B*<b*H* (E HOB « E Use has 

been made of the condition E 4> k <t> k * » L 

The vectors 4> k span the observation space H^. While the vectors ^ k do not 
span the input space Hj, they do span the range of B*$*H*. So far, no attempt has 
been made to obtain vectors that can be used to expand functions in the state space 
or in its dual space H^*. To this end, define 

Xj^-X” 1 $Bt k , F k - X~ l d>*H*$ k . (2.31) 

The vector x k is in the state space whereas the adjoint variables p k are in th* dual 
space. In general, neither one of these two vectors however spans the state space x'j. 

Remart 2.14 The vectors x k and p k are orthonormal with respect to H*H and BB* 
respectively, i.e., 


v H *«v 

* 0, P k *BB*p m ■ 0, 

k v m, 

(2.32) 

V H,Hl k 

» 1, p k *BB*p k . 1. 


(2.33) 
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These results can be established by the following sequence of operations: Xj_*H*Hx^ ■ 

(Hx^)*H* m * 4nii • Since ^ and are 

orthonormal, then (2.32) and (2.33) follow. 

Remark 2. IS The vectors x^, ^ and p^, ij, are related by 

4> k = Hx^ and * B*p k - (2.34) 

This result follows readily from the definitions in (2.27), (2.29) and (2.31). 

Remark 2.16 The vectors x^ and p^ satisfy the boundary-value problem: 


A 

” 

0 


X 


0 

BB* 


X 




k 

1 




k 

0 

A* 


P 

X 

H*H 

0 


p 

NX 

1 

J 


Lk J 

k 




L k J 


(2.33) 


This result can be established by operating on x^ in (2.31) by A and on p^ by A* to obtain 


Aij, » X^, 1 By k and A*p^ « > H*#^. 


(2.36) 


Then, substitution of (2.34) in (2.36) implies (2.35) 

Note the similarity between this problem •. 4 those traditionally encountered as 
necessary (and at times sufficient) conditions for optimality in quadratic optimal 
control and estimation problems for linear systems. 

The Fredh olm ^es ohrgnts of the Covariance Operators O tnd R 

The Fredholm resolvent of R is defined as that integral operator such that 
(1+ R) -1 = I-P, a relationship which immediately implies that 

R = P 4 RP and R * P 4 PR (2.37) 


In terms of the corresponding kernels r and p, these equations become 

r(x/C) * p(x/() 4 Xq r(x/T|)p(V?)dT) (2.38) 

for the case with continuously distributed data. In cases with discrete data, R and P 
are matrices whose general elements R. and P. are related by 


N 

*k,m 3 *k,m + ^ ^k,n^n,m" 

n*l 


(2.39) 
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In both of these equations (2.38) end (2.39), the unknown is the Fredholm resolvent P, 
whereas the observed-state covariance kernel R is known. 


qamertr 2.17 The integral operator R and its Fredholm resolvent P commute. This 
result, which can be stated as 

RP = PR, (2.40) 

is a direct consequence of (2.37). 

Remark 2.18 Equations (2.37) also imply that 

P =(l4R) _l R * R«+R)“ l , R = (I-P)~ l P « P(I-P) -1 - (2.41) 

Remark 2.19 In a manner analogous to (2.37) - (2.41), it is possible to define the 

resolvent S of the dual-state covariance operator Q by the relationship (UQ)~ l * I-S 
which implies 

Q = S ♦ QS, Q = S + SQ, SQ * QS, (2.42) 

and 

S = (!+Q) —1 Q - Q(I + Qf\ Q » (l-Sf X S « S(I-S)“\ (2.43) 

Remark 2.20 The Fredholm resolvents P and S can be expressed as 

P = E (\’/(U\’)] 4> k 4> k *. S» Z IXj/(UX k )Nr k t k *. (2.44) 

p(x/?) = E [X k /(1+Xj)l 4> k (x)4> k (f), s(x/*)« E [X^(l+X^Hr^x)t k (C). (2.45) 

These expansions can be established by substituting (2.26) and (2.30) into (2.37) and 
(2.41). 

Trigonometric Operator Forms 

Remark 2.21 The predicted-data- covariance operator (UR) can be expressed as 

I 4 R s I 4 TAN a a - SEC a a, (2.46) 

where TAN a a and SEC a a are the operators 


TAN a a » £ tan 2 a k 4> k <p k * . R, 


SBC 2 a ■ E se^a^k^k*’ 


(2.4?) 
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and tan is defined by tan « X^,. Note also for later xeference that 

(I+R) ^ » SECa * £ seca^ 4 > k 4 > k * , R ^ ■ TANa ■ £ tana^<J>^ 4 >^* . (2.48) 

Proof; Recall that tan ci g £ thereby establishing (2.47). Use 

of the formal expression I = E <|> k <l> k *for the identity I implies that (I+R) « E (l+tan a a^) 
^k^k*’ wllich le * d# to ^2.46). Equations (2.48) are obtained from (2.46) and (2.47) by 
performance of the square-root operation. 

Remark 2.22 Ihe Fredholm resolvent defined as P ■ I - (I+R) -1 of the covariance 
operator R can be expressed as 

P * SIN a a, (2.49) 

where SIN a a is the operator defined by the expansion 

SIN 2 a = £ E [X a /(l+X a )M> k 4> k *. (2.50) 

Proof; This result can be established by substitution of tan 2 ^ « in (2.44). 

R*m*rk 2.28 Equations (2.47) and (2.48) together imply that 

P = Rd+R) -1 - TAN a a[I+TAN a a]“ l - TAN a a {SEC a a] — 1 - S!N a a, (2.51) 

a trigonometric operator identity that ran be viewed as a generalization of a imilar 
identity involving scalars. 

3. DERIVATION OF THE LIKELIHOOD FUNCTIONAL 

Based on the results of the previous section, it is now possible to derive the 
likelihood functional in (1.3) to be minimized. Since the development required to 
achieve this is fairly lengthy, it is convenient to summarize in adivance the pivotal steps 
involved in the derivation: 

• the integral operator model y ■ m ♦ H6Bo + n in (2.3) is first coverted into an 

equivalent "spectral* form y k « + \ k » k + Uj,* where y k « 4> k *y, « k » 

n k ■ 4> k * n are the corresponding spectral coefficients. 

• the spectral coefficients y k of the data y are a sequence of independent 
Gaussian random variables with mean B(y k ) > m^, covariance - 1 + X a and 
probability density P k (y k ;9) ■ n~ exp [-(y^m^/Ra^] 
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• * “finite-dimensional* likelihood ratio is then defined as the product of a 
finite number N of terms involving the probability densities p^(y^;0) above. 

• an "infinite -dimensional" likelihood ratio is obtained by letting the number N 
of spectral coefficients approach infinity. The related negative-log 
likelihood functional in (1.3) is obtained by taking the negative logarithm of 
the functional that results from the limiting process. Of course, in cases 
where the data is finite-dimensional (obtained by means of a finite number of 
discretely located measurements), the limiting process Involved in this last 
step is not necessary. In this case, the "finite - dimensional" likelihood 
function obtained in the previous step is the function to be minimized to 
obtain the parameter estimates. 

The remaindc of this section contains a more detailed derivation of the foregoing 
results. 

Recall that 

y = m + H$Bo 4 n, (3.1) 

where m * H4>Cf and f is the input. As outlined above, the first sv~p toward evaluating 
the likelihood function is to convert (3.1) into an equivalent “spectral" form by using he 
eigenvectors <f> k and i.e., 

y - E y k <t> k . «■> * E « k * k . n - £ n^, m E nyjy (3.2) 

Substitution of (3.2) and (3.1) and premultiplication of (3.1) by <J> k * leads to 


Vk*V 


(3.3) 


Result 3.1 ^ and n^ are independent Gaussian random variables with mean and 
covariance given by 

MEAN COVARIANCE 

E(o k ) = E^) * 0 B(y*) » 1 + 


E(y k ) * n^ E(« k ) “ * 1 

E(7. 7 ) * 0 m*k 
t m 

where y. » y k - m^. Hence, y k is a sequence of independent Gaussian random variables 
with mean m^ and covariance 1 4 X k> 
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M 

Let y * [yj.—.yjj] be an N-dimensiorul vector consisting of the first N spectral 
coefficients y.^ of the data y. Because y^, are independent Gs. a random variables 

with mean and covariance o£ » 1 ♦ \*. their corresponding probability densities 

P k (y k ;0) » n _/l c7~ l exp (-y*/2a*) can be multiplied to obtain the probability density 

ltf N 

p(y ;9) of the composite N- dimensional vector y , i.e., 


N 

N 



p(y N }0) = n 

p k (y k J0> * n 

o^if^ezp 

(3.4) 

k.l 

k-1 




In order to obtain a likelihood functional for the identification problem with the 
function-space process y as the data, it would be desirable to set N-*°° and obtain what 
would be in the limit a probability density functional (PDF) for the vocess y. 
Unfortunately, this limit may not exist because the right side of (3.4) may not converge 
as and consequently a PDP for the process y cannot be defined in this manner. 

However, this can be circumvented by dividing by 
N 

p o (y N s 0) . R tr“ Vl exp (-yJ/2). (3.5) 

k-1 

This results in 

N exp [-(y -m )*/2(l+\ )] 
k k k 

A(y N ;0) « n , (3.6) 

V* 

k»l a a 

(1*\ ) exp (-y /2) 

k k 

N 

which can be viewed as a likelihood ratio consisting of the PDF of the process y with 

the "signals" and m^ nonzero, divided by the similar PDP of y^ with the signals 

and rrij, set to zero. The term likelihood ratio used to describe (3.6) is consistent with 

terminology common in the theory for detection of Gaussian signal! in additive 
Gaussian noise [3). 

Although the limits of p(y^}9) and P Q (y**t6> appearing respectively in (3.4) and (3.5) may 

not exist when taken independently, the limit of their ratio in (3.6) is a well-defined 
quantity specified by 
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exp[(-Vj) (y-m)*(I+R)~ ^(y-m)] 


(3.7) 


\<y,Q) = Urn A(y**;0) 

[det(J+R)J V * exp[(-V*)^y] 

This is the desired expression for the likelihood ratio that the maximum-likelihood 
method seeks to maximize . It can be interpreted as the likelihood ratio for the 
detection of the "signal" m ♦ K<t>Bu in (3.1), in the presence of the noisy Gaussian signal 
n. Instead of maximizing A(y;6) directly, it is more convenient to minitnig* the 
negative-log likelihood functional defined as J(6;y) * -log [A(y>G)], or, more explicitly, 

J(©jy) = */a log det [I+R(0)1 + */« (y-m(0)]*[I+ R(0)] -1 [y-m(0)l - '/* y*y. (3 8) 

Note that for the special case with no deterministic input. ,*m=0 in (3.1), and the 
negative-log likelihood in (3.8) educes to 

J(0;y) = V. log det [I+R(0)J - V. y*P(0)y, (3.9) 


where P(0) * I - (I+R(0)] 1 is the previously defined (in Sec. 2) Fredholm resolvent uf 
the predicted-data-covariance operator R. 

The first term in both of these last two equations can be cast into an equivalent and 
somewhat more convenient form in use of the identity [4] 

log det [I-f R(0)] = Tr log [I + R(0)]. (3.10) 

Substitution of (3.10) in (3.9) leads to 


J(0;y) * V* Tr log [1+ R(0)] + V* [y-m(0)]*ll+ R(0)f x [y-m(0)} - V* y*y, (3. 1 1) 

which has been recorded previously as (1.3) and constitutes the central aim of this 
section. 

Reorientation 


The method of maximum likelihood, as defined here, results in estimates that 
minimize J(0;y) in (3.11). This minimization problem can be viewed as a function-space 
nonlinear programming problem subject to the system model constraints that 
R(0) = K(0)<t>(6)*(0)B*(0)<t>*(0)H*(0) and m(0) » H(0)4>(0)C(0)f. Since no closed-form 
solution to this problem exists, it is necessary to use .numerical methods for 
optimization. However, there exist alternative formulas for the likelihood ratio that 
are more convenient to use in the implementation of the numerical methods. Such 
formulas are developed in the following section. 
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4. ALTERNATIVE FORMULAS FOR THB LIKELIHOOD FUNCTIONAL 


V, Tr log IUR(0)] 4 ‘/, ly-m(0)]*H+R(e)} ‘ly-m(0)] - V. y*y 

BASIC 

V. Tr log [UR(0)J ♦ V, Iy-m(0)j*[y-Hu o (0)J - */,y*y 

SMOOTHING 

*A L llog(UX a ) 4 (l4X*)~ 1 (y i _-m^) a -y£] 

SPECTRAL 

Tr log [UK(0)J 4 */* z*(0)z(e) -z*(0)y 

SQUARB-ROOT 

PILTBR 

Tr log [SBCa(0)] 4 V. z*(0)z(9) -z*(0)y 

TRIGONOMETRIC 

OPERATOR 


In the above table, the basic formula is expressed in terms of the suspected mean m and 
covariance UR = I + of the data y. The smoothing form is specified in 

terms of the optimal smoothed estimate u Q * E(u/y), representing the conditional mean 

of the state u given the data y. The spectral formula is obtained by substitution in (1.3) 

of the eigensys^em expansions R * EX a y • Ey^$^, and m * where X® 

and <t>^ are the eigenvalues and eigenvectors of the observed-state covariance operator 

R. The squ are-root filter formula, previously recorded in (1.4), is based on the 

y. «/* 

factorization of the predicted-data- covariance operator as (UR) » (UR) (UR) and 

-i % 

on the definitions z * Ly 4 (I-Lhn and (UK) * (I-L) ■ (I+R) . Finally, the 

trigonometric operator formula is obtained from the square-root filter expression by 

use of the identities UR • SBC*a and L • I- CO Sc developed in Sec. 2. 

Although the derivation of the above expressions leads to significant insight about the 
structure of the likelihood functional, it is not within the scope of the paper to 
inver'igate all of these alternatives to the same level of detail. The formula involving 
tut predicted-data- covariance square-root filter appears to be the most convenient to 
implement the numerical search for the optimal estimates. This section, however, aims 
to first develop the results summarized above. 

Formulas Based on the Opti cal Smo othed Stat * gjtiBMr s& 

Result 4.1 The negative-log likelihood functional can be t pressed as 
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(4.1) 


1(6*) « V* Tr log (UR(0)J ♦ V. [y-m(0)]*[y-Hu o (0)J - */* v'y, 

u (0) = G(0)y + [I-G(0)H]<b(0)Cf, (4.2) 

c 

G(0) = R(0)H* H+HR(0)H*r l « (4.3) 


where u q = £(u/y) is the conditional expectation of the state u given the data y, and G is 
the estimator gain. 

Proof: It will be shown in Sec. 8 that n in (4.2) is the conditional mean and that G in 

o 

(4.3) is the corresponding estimator gain. Therefore, for the sake of the discussion 
here, assume that (4.2) and (4.3) are valid. Multiply in (4.2) by H and use (4.3) in the 

resulting equation to obtain 

Hu o = HGy + (I-HGhn, (4.4) 


and 


y - Hu = (I-HG) (y-m), (4.5) 

where m * HOCf is as before the suspected mean of the data y. However, recall the 

identity HG = HRH*(UHRH*) -1 - I - (UHRH*) -1 so that I-HG - (I+H5H*) -1 - (UR) -1 . 
Hence, substitution of this last identity in (4.5) leads to 

y - Hu * (UR) -1 (y-m). (4.6) 

o 

This is the central result required to establish the equivalence between (4.1) and \3.11). 
To this end, substitute (4.6) into the second term on the right side of (4.1), and observe 
the equivalence with (3.11) by inspection. 

Result 4.2 The negative-log likelihood functional can be expressed as 

J(0;y) » V. Tr log IUR(0)] + V* IB(u)J* H*H (B(u/y)] - V, lB(u) + B(u/y)]*H*y (4.7) 

where E(u) » OCf and B(u/y) are respectively the unconditional and conditional expected 
values of the state u. 

Proof: This result can be established as a corollary to the Result 4.1 by co mb ini n g 

the last two terms on the tight side of (4.1) and use of the equation m * Hd>Cf. 

Both of these results express the likelihood functional in terms of a quantity u q in (4.2) 

which is the conditions, expectation B(u/y) of the state given the data y. This quantity 
is also known to be the best linear mean-square estimate as well as the optima) 
least- squares estimate. The coincidence of the best mean-square estimate and the 
optimal least-squares estimate, both of which can be computed by the conditional 
expectation formula (4.2), is explored at length in Ref. (1). 
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Remit 4.3 The negative-log likelihood functional can be expressed as 


J(0;y) = V* £ [logfWX*) 4 (I4X*) 1 (y^-m^) a -y^], (4.8) 

where y^ = <J>^*y and * <t>^*m are the spectral coefficients of the data and the 

suspected mean m, and X* are the eigenvalues of R. By substitution of X^ - tana^ in 
(4.8), this equation can be cast as 

J(0;y) = V* £ [log (sec*^) 4 cos’a^ (y^ - m^-y^J. (4.9) 


Proof: Equation (4.8) can be established by taking the negative log of A(y ;0) in 

(3.6) and letting N-*». Use of the identity X^ * tano^ in (4.8) leads to (4.9). 

Result 4.4 The negative-log likelihood ratio can be expressed as 

J(0;y) * £ [V* log (WXj) 4 V. zj - z^jJ, (4.10) 


where 


Zj, * Lj, y^ 4 (l-L^lm^ and » 1-cosa^. (4.11) 

Proof: Define the "residual* process 

e^ » y^ - Zj,, (4.12) 

as the difference between the data y^ and the "filtered* estimate z^. Observe that e^ « 
cosa^/y^-m^) by substitution of (4.11) i=. (4.12). Substitute this last equation into the 
second term on the right side of (4.9) to obtain (4.10). 

The formula for the likelihood functional in (4.10) can be viewed as the "spectral" 
version of the predicted-data- covariance square-root formula described below. 

Predicted- Data- Covariance Square- Root Formula for y^jfej oi Functional 

Result 4.5 The negative-log likelihood functional can be expressed as 

I(0iy) - Tr log [I4K(0)J 4 V. z *(0)z(0) - z*(0)y, (4.13) 

where 

z(0) - L(0)y 4 [I-L(0)]m(0), (4.14) 

with L(0) and K(0) defined as 
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L(0) = I - [I+R(6)]' V *, K(0) « fUR(0)] V * - L (4.15) 

Proof : Conversion of the first term (1/2) Tr log [UR(0)j in (3.11) into Tr log H*K(0)] 

follows because (1/2) Tr log(UR) * (1/2) Tr log (I*K) a « Tr log rt+K). Conversion of the 
last two terms on the right side of (3.11) into the desired form in (4.13) follows from the 
identity 


(y-m)*(UR) 1 (y-m) « [(I-L) (y-m))*|d-L) (y-m)] » (y-x)*(y-z), (4.16) 


where use has been made of the fact that (UR) 1 *• (I-L*) (I-L). 

Result 4.6 The operators L and K can be represented in terms of the following 
eigensystem expansions: 

L = Z (l-cosoj,)^^* K * £ (seco^ - 1) 4> k <p k *, (4.17) 

where ol^, = tan l \ k , and 4> k are the eigenvectors of R. 

Proof: Let 


L = Z L k 4yt> k * with 1^ - 4> k *L4> k , (4.18) 

and then evaluate the as yet undetermined coefficients Lj, from L * I - (I+R) - ^* in 
(4.15). To this end, premultiply L in (4.15) by 4> k * and postmultiply by 4> k to obtain 
a -*/i 

L^ -- 1 - (1+Vj.) * 1-cosa^, which is the desired result. 

Similarly, to obtain the desired expansion for K, seek to determine the coefficients Kj, 
in 

K = £ K k 4> k 4> k * with ^ - <b k *K 4> k . (4.19) 

Multiplication of K in (4.15) by 4> k * and 4» k leads to K k » 4> k *K<J> k * (l+X*) ^-1 « 

seca k ~l. 

Trigonometric Operator Formulas for the T-iVaHhnod Functional 
Result 4.7 The log- likelihood functional can be expressed as 

J(0?y) - Tr log [SEC^eiJ ♦ */. z*(0)z (0) - z*(0) y, (4.20) 


where 
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*(9) > (I-C0Sa(6))y + COSa(9)m(0), 


(4.21) 


with COSa(0) * [UR(0))~^* * Z cosa^^* (4.22) 

Proof: Recognize that (4.17) is^lies that 

L(0) - I-COSa(0) and K(0) * SBCa(0) - 1, (4.23) 

and use these identities in (4.13) and (4.14) to obtain (4.20) and (4.21) respectively. 
Result 4.8 The negative-log livelihood functional can be expressed as 

J(0;y) » Z [log seca^O) ♦ V. z£(0) - ^(0^(0)!, (4.24) 

where and y k are the “spectral" coefficients 

z fc (0) = 4> k *(0)z(0), y k (0) * (4.25) 


and as before o k = tan x \^,, with being the eigenvalues of R. 

Proof: This result, which is closely related to Result 4.4 above, can be established by 

observing that SECa(0), z(9) and y in (4.20) can be ezp a n d ed as 

SECa(0) = £ seco^ * * Z 7 * Z (4.26) 


Selection of Preferred Formr * 1 * for S earch ha bmuutiop 


In principle, all of the above formulas for the likelihood functional J (©jy> can be used as 
a point of departure to compute the gradient dJ/39 and the corresponding Hessian 

d 2 J/d0 3 - and to thereby obtain the necessary ingtedients to implement the 
Newton- Raphson search for optimisation. The calculations involved in the numerical 
search can vary significantly, however, depending on which of the forms is used as a 
starting point. It is therefore of interest to conduct a detailed investigation of the 
relative advantages and disadvantages of the various methods to implement the search 
that arise from the various forms of the likelihood functional. Such an investigation is 
currently in progress and will be reported on in future work. In this paper however, the 
formula selected to compute the gradient and Hessian is that based on the 
predicted- data- covariance square-root filter in (4.13). 


5. 


COVARIANCE BIGBNSYSTRM SENS! 


!U f i? 


TO SMAT.T. PAP AMRTRP CHANGES 


As a preliminary to the evaluation of 3J/30 and d s J/38 2 involved in the numerical 
search for optimization, it is necessary to conduct an analysis of the perturbations 6\ k 


and 64> k of the eigenvalues and eigenvectors of R • H<hBB*4>*H*, with respect to 
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variations 66 of the parameter distribution 6. Such an analysis will provide the 
mathematical tools that will be used in subsequent sections to evaluate dJ/66 and 

0 J j/ae 2 . 

By definition! and ^ are the nontrivial solutions of 

R (0)^(6) = X^e^e), (5.1) 

where the dependence on 6 of R, 4^ and Xj, has been explicit. The ultimate objective 

of this section is to develop analytical formulas for calculating the first-order 
perturbations and 6<j> k of and 4> k with respect to small changes 66 in the 

parameter distributions 6. 

Definition of 6X^, dX.^/36, 6d> k and 6d^/36 

It is assumed here that the Frechet differential [2] of X^, at 6 exfsts and that it can 
be computed by 

6\ k (6;66) * JdX k (6+Y6e)/dTl Y=0 , (5.2) 

where y is a scalar and 66 is an admissible perturbation of 6. Bquation (5.2) is actually 
the formula typically used for computation of the Gateaux differential. However! it is 
assumed here that both of these derivatives exist and coincide and that therefore (5.2) 
can be used to calculate the Frechet derivative. 

Since X k is Frechet differentiable (admittedly by assumption, as an investigation of 

the technical conditions required for differentiability is not within the scope of this 
paper), its differential 6\ k (6;£8) can be expressed as 

6\ k (6;66) - iaX k (6)/aej6e, (5.3) 

where c X k (6)/d6 is a bounded linear functional referred to as the Frechet derivative of 
\ k ai 6. The transformation 3\ k /38 can also be viewed as a function space gradient of 
\ k at 6. Similarly, the eigenvector differential 6$ k (d;£'') is defined as 

6q> k (6;66) - I3d> k (6;66)/36] 60. (5.4) 

where (3q> k (6)/38] is the Frechet derivative, assumed to be linear and bounded. 

Calculation of 6X^ and 3X k /36 

Recall that the $ k in (5.1) are orthonormal so that 
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0 , 


m*k. 


(5.5) 


4> k *4> k * 1 and \*4> m * 

Multiplication of (5.1) by $ k *and use of 4> k *4> k » 1 leads to 

X k * ♦r** 4 V <5.6) 

which can be taken as the point of departure for calculation of 5X> k and dX^/39. 

Result S.l The Frechet differential 6)^, (6:56) can be expressed as 

6\ k (6:5e) . -\J(6> lp k *(e)6A(©:6e)x k (0)), (5.7) 

where 6A(9;60) is the differential of A defined as . 


5A(6;50) » [dA(0+Yde>/dYl YxO (5.8) 

and p k and are the vectors defined as p k * \ k 1 <h*H^4> k and ■ X~ l <5B^r k in Sec. 2. 

Proof; Performance of a first-order perturbation on (5.6). and use of the condition 
4> k *6<J> k * 0 leads to 


6\ k = (2\ k ) _l <t> k *6R<t> k , (5.9) 

where 5R(6:56) =* (dR(©4Y50)/dYj evaluated at y * 0. However, since 5R * 
6(H4>BB*54>*)H*, then 

5S » H(64>)BB*4>*H* 4 H4>BB*(64>*)H*. (5.10) 

It can be observed from (5.10) that evaluation of 6$ is the central calculation 
required to determine 55. In order to simplify notation, without loss of generality, it 
has been assumed in arriving at (5.10) that B and H do not depend on 6. In most 
practical cases, this assumption is satisfied because the poorly known parameters occur 
in the operator A. 

To compute 6<h, as required by (5.10), recall that A(0)<b(9) • I, so that (6A)4> 4 
A(64>) s 0, and 

6<|) * -4>(6A)4>. (5.11) 

Substitution of (5.11) in (5.10) leads to 

6 R « - H4>(5A)4>BB*4>*H* - H4>BB*4>*(6A)*4>*H* (5.12) 

Multiplication by 4> k *(* )$ k results in 
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(5.13) 


= - <4> k *H<t»6A«l>BB*<t>*H*4> k ) - ($ k *H<DBB*$*)6A*(<D*H*4> k ). 

Finally, use of the definitions p fc = X” 1 <t>*H*4> k and = X” l <Mfcp k in (5.13), and 

substitution in (5.9), implies (5.7). In performing this last step, it has been assumed that 
A = A* is formally self-adjoint, a condition that is valid on most problems of practical 
interest. 

Discussion and Additional Assumptions on A 

The above result, although a step in the right direction, is still somewhat 
intermediate because the differential 6X k in (5.7) is expressed in terms of the yet to be 

determined differential 6A. To proceed further, it is convenient to make two additional 
assumptions (typically satisfied in practice): 

• A(0) is linear in 6 so that A(0 ^ + 0^) * A(0 j) ♦ A(0)^ for two admissible 
distributions 0^ and 0^. 

• A(0) can be factored as A(0) * D*(0)D, where D and its corresponding formal 
adjoint D* may in general be matrix differential operators. 

Based on these assumptions, it is now possible to derive the following more explicit 
formulas for 6X fc and 3X^/3©. 

Result 5.2 The Frechet derivative 3^ (0)/30 of ^ is 


3X k (0)/30 = Xj Dp k (0) • Dx k (0). (5.14) 

Proof: Since A has been assumed to be linear and factorable 

6X k = -X^p k *D*(60)Dx k - Xj <(Dp k ) 60 (Dx^ (5.15) 

where the last equality is a consequence of a process analogous to integration by parts. 

Result 5.3 Since 3X^/30 has been assumed to be a bounded linear functional in X, 
it must be expressible as 

[3X k (0)/30]60 = <3X k (0; • )/30, 60> x , (5.16) 

where 3X^_(0; • )/30 is an element of X*(Q). Futhermore, I3X k (0{ *)/30] can be evaluated 
from 

3X k (0,x)/30 * X“ Dp k (0;x) • Dx^Oix). (5.17) 
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Proof : The rigorous derivation of this result is not as yet available. The result is 

accepted somewhat formally on the basis that a bounded linear functional can be 
represented by an element in the dual to the space in which the functional is defined. 

Calculation of 6<b^ and 3d>^/88 

Result 5.4 The Frechet differential 64 ^ (0»60) of 4^, can be expressed as 

6<|> v (6;6e) = £ ((d> * 6 R 4 >.)/a* - \*)] A , (5.18) 

if m k c m * m 

m*k 

where 6R is the differential of the observed- state covariance operator R. 

Proof : Since R4> k = X^4>^, 


(6Ei<p k + R64>j. = * \j 64 y (5.19) 

Now, seek an expansion for 6$. in terms of the orthonormal basis $ , i.e., 

K m 

‘-V E 'tanV 'tan ■ ts m 

m*k 

where c. are scalar coefficients to be determined. Note that the orthonormality of 
km 

4> k implies that c^ = 0, so that 6$ k does not have a component in the direction of 4> k - 
To evaluate c^, premultiply (5.20) by 4> m * to obtain 

O^* 6 **™ ♦ 4> C5.21) 

m mm K K m x 


Use of the conditions $ *R 

m 

leads to 


\ 2 4> * and c. . 4> *64>. and rearrangement of terms 

id in cm m c 


c km ‘ < V 4t V /,X i - X m'- <5 ' 22) 

Substitution of (5.22) in (5.20) leads to (5.18), thereby establishing the result. 

Equation (5.18) is similar in nature to (5.9) in that it expresses the desired 
differential in terms of the yet to be determined quantity 6R. 

Result 5.5 The Frechet differential 64 ^, (0$60) of 4 ^ can be expressed as 

^<6,69) . I I Vn. ,(X i- X m >1 'Vm* (4A) H 4 l „V (4A > W (S - 2S > 

m#k 
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where p fc = tod = X~ l OB# k . 

Proof: Substitute (5.12) in (5.18) end use the definitions for end x^. 

Equation (5.23) is valid without making the additional assumption that A(9) is 
linear in 6 and factorizable as A(6) = D*(0)D. If these two assumptions are now made, 
the following result can be obtained. 

Result 5.6 The Frechet derivative 3^, (0)/30 is specified by 


at> k (0)/ae= z 


I lK k Dp m - Dp k M» : 


m 


m*k 

Proof : This result follows by substitution of 6A(0) * D^fdOlD in (5.23). 

Closely related to <$4> k is the differential 

6(4>k4>k*) = * (6 ^ k ^ k * 


(5.24) 


(5.25) 


of the outer product The corresponding Frechet derivative 3(<p k 4> k *)/30 is 

evaluated in the following result. 

Result 5.7 The Frechet derivative [3<4^ 4^, *)/30] is specified by 


aMyt^l/W - I IX,. [X k Dp m - . X m Dx m - Dp,J I* m v * Vm*>- 

m*k (5 26) 


Proof: Use (5.24) to evaluate the right side of (5.25) and recall that * 

ld(<t> k <i> k *)/d0]60. 


Discussion 


The results obtained above provide the key tools required to evaluate the 

function-space gradient 3J/30 and Hessian 3 a J/30 a of the likelihood functional. The 
most useful formulas are (5.17) for the derivative 3X k /d0 of the eigenvalue \ k> (5.24) 

for the derivative 3q> k /39 of the eigenvector dy and (5.26) for the derivative 

3(4 ) k<t>k*V30 of the outer product ($ 6 *). These formulas will be used repeatedly lr. 

the following section. 
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6. SPBCTRAL REPRESENTATIONS FOR THE GRADIENT. APP POTTMATR husstam. 
AND NEWTON- RAPHSON SEARCH 

Implementation of the modified Newton- Raphson search for the optimal parameter 
estimates requires calculation of the gradient 3J/30 and of an approximation to the 

Hess 'l operator 3 a J/30 a . These calculations are best achieved nalng the 
predicted-data- covariance soua re-root filter in Result 4.5 that expresses the likelihood 
functional as 

J(6 } y) = Tr log [UK(0)] 4 V. z*(0)z(0) - z*(0>y, (6.1) 

where z(6) = L(6>y 4 [I-L(8>] m(0). Function space differentiation of (6.1) with respect 
to 0 leads to the gradient functional 

g(0;y) = 3I(0;y)/a0 = Tr (OL/30) (UK)) 4 (t-y)*<dz/30), (6.2) 

and to the approximate Hessian operator 

M(0;y) = Tr IOL/30) (UR) (3L/30)) 4 (3z/30)*(dz/d0), (6.3) 

upon which the Newton- Raphson numerical search is to be based. An updated estimate 
0 n+ 1 = 0 n - 60 n is obtained by specification of the parameter change 66° defined as 

60 n = M~ 1 (0 n ;y)g(0 n ;y). (6.4) 

The main objective of this section is to replace the operator s , arions (6.2) and (6.3) 
with a set of equivalent matrix equations more convenient calculations. The 
fundamental approach to be used consists of representing the ^unction space derivatives 
3L/30, 3m/38 and dz/80 - which have only been derived in terms of operator symbols in 
(6.2) and (6.3) - in terms of e specific orthonormal basis defined by the eigenvectors 

of the observed- state covariance operator R. 

Spectral Representation for the Gradient 

Result 6.1 The Frechet derivative 3L/30 of the predicted-data- covariance 
square-root filter L can be represented as 

3L/30 = Z Z ‘kmVm*’ (6 * 5) 

k m 

where the spectral coefficients a ^^ ■ $j,*(3L/30)4> m are specified by 


4 kk = ,ln ‘*k Dp k* °V 


( 6 . 6 ) 


A km * ^m^rn "^ 1 Ico,a k * co,a m 1 ^k^m* ' X m r "‘m^k 1 (67) 
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Note that a^ m defines a matrix whose diagonal elements are provided by (6.6) and 
whose corresponding nondiagonal elements are given by (6.7). 

Proof : Observe L = £ (l-coscOq^* implies 

3L/ae * £ {sina i (da.JdO'kpfl* - cota { Of^.^/aej}. (6.8) 

Substitution of this equation in a ^ = <J> k *(3L/36)d> m and use of orthonormality of d> k 
lead to 

»y, = sina^ cos^ql^ (3X^/3©), u. ^ m ■ -eota ^*0^ m /d8) - cosa k (3# k */36)<|> m , (6.9) 

where dX^/39 and 3# k /36 are the function-space derivatives evaluated in (5.18) and 

(5.25). Substitution of these two equations from 'Sec. 5 in (6.9) leads to (6.6) and (6.7) 
thereby establishing the result. 

Result 6.2 The Prechet derivative dm/36 of the suspected mean m(6) is represented by 

dm/36 ■ £ (3m/30) k 4> k , (6.10) 

with the spectral coefficients (3m/30) k specified by 

(dm/36). * \ k (Dp k * D<t>Cf), (6.11) 

and <t>Cf in (6.11) denoting the suspected value of the state u. 

Proof : Since m » HOCf, then 6m » H6d>Cf > -H<hA(/9) <bCf, where the last equality 

follows from the condition 5<h * -<t>A(60)d>. Define now (6m) k as the spectral 
coefficient of 6m, i.e., 

(6m) k * 4> k * 6m - -4> k * HOA(60)<t>Cf - -Xjp^AideWCf, (6.12) 

where as before p k = \ k l O*H*$ k . Use of the identity p k *A(66)<9Cf «. -Dp k *D(<bCf)66 
in (6.12) results in (6m) k ■ (3m/36) k 6fc. with (3m/36) k given by (6.11). 

R esult 6.8 In the special case in which the deterministic input f is assumed to be a 

vector f = (f . _] of M inputs applied at the discrete locations £., an alternative to 

x In i 

(6.11) in evaluating (3m/36) k is 

M 

(3m/36) k - £ \ Dp k (x) *Dq>(x/C m )fm for k “ 1 N * <6.13) 

m-1 
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where 4>(x/£) is the Green’s function of the system model operator A. 

Result 6.d The gradient 3z/30 = OL/30)y+ (I-L) (dm/3©) of the filtered state esti: oate z 
can be represented as 

3z/3© = E (3Z/36) k 4> k , (6.14) 

where the spectral coefficients (3z/30> k * $ k * (3z/36) are given by 

(3z/30) k = E * km (x) y m + £ b km (x)f m’ (615) 

m=l ouT 

with a t specified in (6.6) and (6.7) and 

b. (x) * sin a. Dp. (x) *D4>(x/£ ). (6.16) 

km k k m 

Proof: Substitute 3L/36 and 3m/3© irom (6.5) and (6.10) into 3z/3€* * (3L/30)y + 

(I-L) (dm/30) and then compute the spectral coefficients (3z/36) k in 'C. 14) from 

(3z/36) k = <t> k *(3z/36). 

Result 6.5 The gradient g(0;y) in (6.2) can be represented as 

g(0;y) = E Isin a a k tana k (Dpj^'DXj^) - e k (3z/3©) k J, (6.17) 

where = 4> k *e are the spectral coefficients of the residual process e » y-z, and 
(3Z/36) k are given by (6.15). 

Proof: Substitute 3L/36 in (6.5), dz/36 in (6.14), e » E e <b and I + K * 

£ seca k <|> k 4> k * into (6.2) and use orthonormality of <)> k . 

Equation (6.17) provides the means to evaluate the likelihood functional gradient, one of 
the key ingredients of the Newton- Raphson iteration. The approximate Hessian 
operator M(6;y), which . the other major element required to implement the search, is 
evaluated below. 

Evaluation and Inversio n of the Approximate Hessian 

Result 6.6 The approximate Hessian M(9ty) in (6.3) is an integral operator whose kerne) 
M(x/£) is specified by 

M(x/f) » E (sec*^ a^Ix) a^f) ♦ z' k (x)z' k (f)l, (6.18) 
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where z' k = (dz/30)^ = <p^*(dz/dQ) is the k 1 * 1 spectra) coefficient "f ?z/d9. 

Proof: Substitute (2.26) and (6.14) into (6.3) and use the orthonorrraiity of 

Implementation cf an iteration step in the Newton- Haphson search requires calculation 

of 60 n = M 1 (0 n iy)g(0 D ;y), representing the incremental change in the parameter 

estimate. Inversion of M(O n ;y) is therefore required at every step of the .earch. This 
inversion is achieved by solving an integral equation as outlined in the following result. 

Result 6.7 The incremental parameter change 6 0 n can be computed as the solution 
of the following integral equation 

S Q M n (x/O £3 n (£)d£ = g n (x), (6.19) 

where is the approximate Hessian ‘ emel in (6.18), and g^(x) is the value of the 
gradient at the spatial location x. The subscript n in M q and g Q denotes that the 

corresponding quantities are evaluated at the parameter estimate 0=Q n . 

Proof: Observe that 60 n = M.~ l g implies M n 60 n = g Q , and express this last equation 

in term 1 ! of the kernel M to obtain (6.19). 

n 

7. PARAMETER ESTIMATION ERROR, CSAMES-RAO BOUNDS AND OPTIMAL 
INPUT DESIGN 

The objectives here are to obtain a C-R bound for the covariance of t ! '.e parameter 
estimation error and to begin an investigation of the problem of optimal input design by 
using the C-R bound as a criterion for optimal input selection. 

Recall that the covariance of an unbiased estimate 0 satisfies the inequalitv 

E(0 0 ♦) i M _1 (0 ), (7.1) 

P P o 

where M(0 q ) L. the information operator uc lined as 

M(0 q ) = Eia a J/30 a ] ese - BIOJ/30) O?/30)*j o _ o _ (7.2) 

o o 

The corresponding mean-square estimation erro*- E(0 p *0 p ) sitis**^-? related 

inequality 


E(C p *0p) i, Tr(M x (© 0 » 


(7.3) 
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It can be observed that the key calculation required to obtain the C-R bound is the 
computation of E(3 2 J/80 2 ] as outlined below. 

Cramt^-Rao Bound for the Estimation Error 


Result 7.1 The information operator M(0 q ) is specified by 

M<e > = Eid^/ae 2 ]^ a = 2 TriOL/a©) a+R) OL*/ae>i 
o 0=© 

(7.4) 

+ (dm*/30) a-L*> a-L) Om/ae), 

where K = Hd>EE*d>*H % ' is the data-covariance operator, OL/30) is the derivative of 
L = I - (I+R) , and (dm/dO) is the derivative of the data mean m=H<DCf. 

Proof: Differentiate g(6;y) in (6.2) to obtain 

a 2 i/ae 2 = Trio 2 L/oe 2 vi+K) + caL/aenaK/ae)] + (z-y)*Oz/ae) 4 Oz/aewaz/ae). (7.5) 
Take the expected ’ xe in (7.5) abote, evaluate at 0 = 0 Q , and simplify to obtain 

E[a 2 J/ae 2 ' a = Tr i(3L/a0) (I+R) OL/30)’ + E[Oz*/a0) Oz/30)J. (7.6) 

y=y o 

Finally, use ' >) in (7.6) to arrive at (7.4). 

Result 7.2 In spectral form, the information operator M(0 q ) is specified by 

M(0 ) * £ (2 sec 2 a^ a^Ix) a^ff) * cos 2 Oj,m'j c (x)m , ^(f)J, (7.7) 

where a^ and m'^ = (Sm/dQ)^ are defined in (6.9) and (6.11) respectively. 

Proof : Use an approach similar to that used to arrive at (6.18). 

Inspection of (7.4) reveals chat the informatior operator M(0 o ) consists of the sum of 

two terms bcth of which are positive definite. In the first term, the data-covariance 
operator (I+R) appears as a "weighting” that is multiplied by the sensitivity filter 
dL/d9. Note parenthetically that in fart L is self-adjoint so that L » L*. The second 
term, on the other hand, will be shown to be a quadratic function of the input f. 

n esult 7.S Assume that f * ,...,L^] is a vector cf M inputs applied at the M discrete 

locations £ . T!ie information operator M(0 Q ) is an integral operator whose kernel 
M(x/£) can be e -.pressed as 


3f\ 



(7.8) 


M(x/f ) = U(x/f) + f T V(x/r'f, 

where 


U(x/£)=£ sia 4 a k Un 2 a k [Pp k (x)*rp k (x)J[Pp k (C)-Dx k (f)]. (7.9) 

k 

V(x/£)* E sin 2 ^ b k (x) b k (?), (7.10) 

k 

T 

and where t> k (£) is the M- dimensional vector 

b k (£) = |Dp k (£)-D<KC/f j) Dp k (f)*D4>(C/C^J, (7.11) 


with 4> being the Green’s function of A in (1.1). 

Proof: Substitute the eigensystem expansions for R in (2.26). for L in (4.17), for 

OL/30) in (6.5), and for dm/d6 in (6.10) into (7.4) to obtain (7.9) and (7.10). 

The second term in (7.8) is a quadratic form in the input signal f. This property can be 
used as a basis for optimal input design. 

Optimal Input Design 

The information operator can be used to state criteria for optimal input design. 
While several possible criteria exist, the one that is easiest to use is perhaps the 
maximization of Tr M(9 ): 


max Tr M(0 ) » U + f T Vf, f T f =* I , (7.12) 

o 

where 

U = J Q U'x/x)dx and V - / fi V(x/x)dx. (7.13) 

The optimal input f Q . which is the solution to the above optimization piob’em, is the 
eigenvector corresponding to the largest eigenvalue of the M-by-M matrix V. 


Other criteria for optimal input selection include: minimization of Tr (M l ), which 
would correspond to minimizing the Cramer- Rao bound; and minimization of 

X (M -1 ), where X is the maximum eigenvalue of M~ l . While these last two 

may 

criteria could be superior to (7.12), they both have the disadvantage of requiring 

inversion of the operator M( ). However, the requirement for such an inversion may 

o 

not be a serious additional drawback because a similar calculation is required to 
implement the Newton- Raphson search outlined in the previous sections. 


355 



Vanishing Bias of the Gradient 


Closely related to the above analysis is an investigation of the bias in the 
parameter estimate 9. The central result is as follows. 

Result 7.4 The expected value of the gradient functional g(9;y) vanishes at 9 * 0 q , i.e., 

Elg<9;y)J | 0=0 =0. (7.14) 

o 

Proof : Observ' that dz/30 = OL/89)y + (I-L) (dm/30), and recall that y = (I+K)e. 

Substitute this in (6.2) and take the ejected value. Finally, use the whiteness of the 
residual process, to be established in (8.46). 

8. FILTERING, SMOOTHING AND THB RESIDUAL PROCESS 

The central aim of this section is to con net an analysis of the smoothed estimate 
u q and of the filtered state estimate z q that emerges from the 

predicted-data-covariance square-root filter. This analysis leads to the following 
major results: 

• The smoothed estimate u q is optimal in a conditional mean sense. 

• The formulas that generate u q and have a predictor-corrector structure in 

which the final state estimate is the sum of: a prediction term-based on 
application of knewn inputs to the system model; and a correction term based 
on the difference between the actual and predicted data. The key element in 
these formulas is an estimator gain that provides the relative weighting 
between the two terms. 

• The covariance of the state estimation error inherent in both estimates can 
be evaluated by means of equations which, if written in operator notation, 
resemble those encountered in filtering and smoothing for linear dynamical 
systems. 

• Investigation of a residual process associated with the filtered state estimate 
z that has properties nearly identical to those of an innovations process: the 

C 

residuals are a white noise process with a unit covariance; the residuals and 
the measurements can be obtained from each other by means of reciprocal 
linear transformations. Because these transformation; *re not causal, the 
residuals are not a bona fide innovations process. However, they are as 
useful in deriving filtering, smoothing and identification solutions for elliptic 
syst .ms as the innovations process is in deriving similar solutions for linear 
dynamical systems. 
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• Development of relationships between the filtering and smoothing estimates 
that can be thought of as extensions to elliptic systems of the 
forward/backward sweep method for solution of filtering and smoothing 
problems in linear dynamical systems. 

• Development oi spectral representations for the predicted-data- covariance 
square-root filter and the optimal smoother in terms of the eigensystem of 
the state covariance R » <t>BB*<b*. This leads to simple ways to implement 
filtering and smoothing solutions on a computer. 

Smoothed and Filtered Estimates 


The smoothed and filtered state estimates u and * have been defined in (1.17) as 

o o 

u o = <t>Cf 4 G(y-H<J»Cf), Z Q « <*>Cf 4 g(y-H$Cf), (8.1) 

where G and g are Kalman- like gains specified by 

G = Z sin a a^x^4>j,*, g * £ (l-cosa^,) x^4>^. (8.2) 

The estimate u is referred to as a smoothed estimate because it is the 
o 

minimum- variance estimate of the state given the entire data set. This is established 
by the following result. 

Result 8.1 The smoothed estimate u in (8.1) is the conditional mean u _ « B(u/y) of the 

Q O 

state given the data. Furthermore, the estimator gain G in (8.2) can be expressed 
alternatively as 

G = RH*(I 4 HRH*)“\ (8.3) 

in terms of the state covariance R * 

Proof : Recall the general formuk. 

B(u/v) * E(uv*) [E(w*)] -1 v (8.4) 

derived in [4] for the conditional expected value of a zero-mean random process u given 
the related zero-mean random process v. Note that this formula requires calculation of 
the “cross-covariance" operator B(uv*) and the auto-coV4riance operator B(vv*). 
Define now the mean- centered state u * u~d>Cf « d>Bo and the mean-centered data y » 
Hu 4 n. By this definition, u and y are zero-mean. Therefore (8.4) can be used directly 
to compute u o = B(u/y), i.e., 


u o E(u/y) • E(uy*) lE(yy*)J % 


(8.5) 


357 


which indicates that to evaluate U qI it is necessary to first evaluate the covariance 

operators E(uy*) and B(yy*). These calculations are: B(uy*) » E(OB6)<j*B*<h*) = 

4>BB*<t>* and B(yy*) = E [(Hu+n) (Hu+n)*] * I * HfiH* Use of this in (8.5) leads to 

E(u/y) = Gy. (8.6) 

This together with the definition of V and J in terms of u and y implies (8.1). The 
equivalence between the two different expressions for G in (8.2) and (8.8) is established 
by use of the spectral expansions in Sec. 2. hi particular, use expansions (2.46) - (2.47) 
for I + R and the definition for Xj, in (2.31). 

As established by this result, the estimate u q has a very well defined probabilistic 

interpretation. It is not presently known if the utered estimate has a similar 

interpretation. Nonetheless, this estimate plays a very significant role in the filtering, 
smoothing and identification methodology for elliptic systems under development here. 
Its role is analogous to that of the filtered estimate emerging from a Kalman filter in 
the case of dynamical systems. This is further investigated below. 

Predictor-Corrector Structure 


To examine this structure, cmsider the equation for u q in (8.1) and illustrated in 

Fig. 8.1. Use of the deterministic input f^ and the system model <t»C^ leads to a 

predicted estimate The difference process y-H<t>Cf ^ is then formed and operated 

on by the estimator gain G ^ to obtain the correction term G(y-H<bCf) Firally, 
the correction term is added to the predicted estimate to obtain the optimal estimate 
u q . The equation for the filtered estimate. z q in (8.1) also has a predictor-corrector 

structure. The key difference between the twe equations in (8.1) is that the estimator 
gains are different. A relationship be' ween these two different gains G and g is 
explored later in this section. 


SYSTEM MODEL 



Fig. 8.1 Predfctor- Corrector Form of the Smoothed State Estimator 
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Estimation Brror Covtmnce ami KahnaiMifce Gains: Smoothing 


Since u q and x q are only estimates of the actual state u, it is of interest to 

investigate the inherent estimation error u * u-u and z » u-z . In particular, the 

p o p o 

aim is to determine the estimation error covariance, under the assumption that the 
actual model errors o and n in (1.1) and (1.2) are white-noise processes. 

Result 8.2 The covariance P * P* * E(u u *) of the state estimation error u = 


— P P P 

u-u q is specified by the following alternative formulas: 

P = (I - GH)R(I - GH)* + GG*. (8.7) 

V = R - RH* (I + HRH*) -1 HR, (8.8) 

P = ( I - GH)R = R(I - GH)*, (8.9) 

P = $B (I + B*<t>*H J 'H<t>B)~ 1 B*<b*. (8.10) 


Proof: To show (8.7), observe that u « d>Cf + <bB» and u q - <bCf + G(y - H<bCf) 

imply that u = u - u is 
P o 

u = (I - GH)<bB» - Gn. (8.11) 

P 

Hence B(uu p *) = E['T-GH)<t>Bu(j*B*<i>* (I-GH)* + Gnn*G*] » (I-GH)R(I-OH)* + GG*. 

where use has been made of the fact that e » [u,n] is a white-noise process with 
covariance E(ee*) * I. To show (8.8), observe that (8.7) implies 

P = R - GHR - RH*G* ♦ G( 4 HRH*)G*. (8.1.) 

Substitution of G = RH* (I + HfH*) -1 in (8.12) leads to (8.8). To show (8.9). observe 

that (8.8) can be expressed as P - R (I-GH)* « (I-GH) 8 by using G - RH*(I ♦ HRH*) -1 
in the last two terms of (8.12). To establish (8.10) , substitute 5 ■ <t>BB*<t>* in (8.8) and 

use the identities B*d>*H* (J+H<J>BB*<t>*H*) _1 HC>B - (I+B*<&*H*H<&B) -1 B*<!>*H*H<I>B - 1 - 

(1+ B*<t>*H*H<J>B) — 1 . 

Result 8.3 The operator HPH* is the Fredholm resolvent of HRH* so that 

(I + HRH*) -1 - I-HPH*. (8.13) 

Proof: Compute HPH* from P in (7.8) to obtain HPH* - HSH* [I - (1+HRH*) -1 ]. Use 

the identity (I4HRH*) -1 HRH* » I - (I+HRH*) -1 twice in this Ust equation to obtain 
(8.13). 

The aim now is to use (8.13) in (8.2) to obtain an alternative expression for the 
estimator gain. 
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Result 8.4 


The estimator gain G = RH*(I+HRH*) 1 can also be expressed as 
G = PH*, (8.14) 


where P = E(u u *) is the covariance of the smoothed state estimation error u . 

P P p 

Proof: RecaU that PH* = (I - GH)RH* = RH*II - (I + HRH*) -1 HRH*]. iince 

(I + HRH*) -1 HRH = I - (I + HRH*) -1 , then PH* = RH*(I + HRH*) -1 = G. 

Result 8.5 The mean-square smoothed state estimation error is given by 

E(u *u ) = Tr (P). (8.15) 

P P 

Proof: This follows from the definition of P as F = E(u u *). 

P P 

Note that many of the above formulas are very similar in form to the ones 
traditionally encountered in Kalman filtering for dynamical systems. For instance, Bqs. 
(8.3) and (8.14) are very similar to those used to compute the gain G for a Kalman filter 
in which R and P are the covariances of the estimation error associated with the 
predicted and _corrected state estimates. Note also that (8.8) implies that P is always 
smaller than R, which implies that the covariance of the estimation error after the 
observation y has been accounted for is smaller than the error covariance before the 
estimate correction occurs. 

Estimation Error Covariance and Kalman-like Gains: Filtering 

The aim here is to obtain results similar to results (8.2) - (8.5) above, but that are 
applicable to the filtered estimate z q . 

Result 8.6 The covariance E(z z *) of the filtered state estimation error z = u-z 
p p P o 

is given by 

E(z p z p *> = (I-gH) R(I-gH)* + gg*. (8.16) 

where R = <t>BB*4>* is the state covariance, and g is the filter gain in (8.2). 

Proof : Note that u = <t>Cf + <J>Bu. This and (8.1) imply that 

z p = (I-gH) <t>Bo-gn, (8.17) 

where use has been made of v-Hd>Cf = H<t>Bu 4 n in (6.1). Calcul'tion of E(z z *), 

P P 

.... 17) and the conditions E(ou*) = I and E(nn*) * I, leads to (8.16). 

This result applicable to the filtered estimate is analogous to (8.7) of the smoothed 
estimates. To obtain results that are analogous to (8.8) - (8.10) requires, howover, a 
preliminary definitions and results. The need for these preliminaries arises from 
-■t jltimate desire to find a spectral decomposition for the state covariance R = 
•' It is straightforward to obtain the spectral representation for the 


360 


observed- stale covariance HRH*. However, finding a similar decomposition of R is not 
as simple. The primary reason for this lack of simplicity is that the vectors ^ = 

X. may not necessarily span the entire space H. This is particularly true in 

cases in which the dimension of the input space is greater than the dimension of the 

observation space Hg. In order to consider this case, assume that the operator HOB has 

finite-dimensional range. This corresponds to the situation where there are only a 
finite number N of seniors and the observed- state covariance R = H0BB*0*H* is an 
N-by-N matrix. Assume also that the input space is either infinite-dimensional or 
finite-dimensional with dimension M greater than N. This second assumption 
corresponds tc cases where the uncertainty is distributed at M discrete locations or 
throughout the entire spatial domain Q. 

Result 8.7 The identity operator I mapping into itself can be decomposed as 

1 = 1 +1,, (8.18) 
o X 


where 


I =I-B*d>*H*R 1 Hd>B, I. <b*H*R l H<bB, (8.19) 

0 X 

and R = Hd>BB*<b*H*- is the observed-state covariance. In addition, I is in the 

o 

null- space of the operator 

R (•) = H<t>B( • )B*<t>»H*, (8.20) 

mapping the space of bounded linear transformations on HjX H^ into the space of 
N-by-N matrices. Furthermore, I Q and 1^ are orthogonal complements so that 

1 *1, = Tr (I IJ = 0. (8.21) 

0 X o X 

Proof : This result and its corresponding proof are illustrated graphically in Fig. 8.2. 

Eq (8.18) follows from (8.19). Substitution of I in (8.19) into (8.20) shows that R (I ) = 0 

so that I is in the null space of R( • ). That I Q and 1^ are orthogonal complements 

follows from substitution of (8.19) in (8.21) by calculation of Tr [I Q 1^] using (8.19). 

Resul 8.8 The i* • • y operator I mapping H^ into itself can be expressed as 

N 

1 * I Q ♦ £ <8.22) 

)-l 
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SPACE OF BOUNDED LINEAR TRANSFORMATIONS 
FROM INPUT SPACE INTO ITSELF 



Fig. 8.2 Orthogonal Complement Decomposition of the Identity in Hj z H^. 

Proof : Substitute R = into I^in (8.19) and use tpj « X~ x B*<b*H*<Pj. 

The above result simply reflects the fact that the ^ do not span H^, because (by 

assumption) there are only a finite number of them, and this number is smaller than the 
dimension of the input space. 

Result 8.9 The state covariance R « $BB*$* can be decomposed as 
N 

R * R q 4 £ X* x j* *• (8.23) 

M 

whirl* 

R « <t>Bl B*<t * (8.24) 

o o 

Furthermore, 

HR H* = 0, HR » 0, R H* * 0. (8.25) 

O 0 0 

Proof : To show (8.23), substitute I from (8.22) into 5(1) - <&B(DB*<t>*, and use x^ » 

X _1 <bB4r , . To show (8.25), substitute I from (8.19) into (8.24) and (8.25). 

) j o 
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Result 8.10 


The dual state covariance Q = d>*H*H<J> can be expressed as 


N 

Q = Z X? PjPj* (8.26) 

)=1 

where p. = - 

J ) J 

N 

Proof : Since the <f>. span the observation space H„ = R , th-a 

) 

N 

I N = Z 4)^* (8.27) 

)=1 

N N N 

v;here I denotes the identity in R x R . To obtain (8.26), substitute (8.27) in Q = 
d>*H*I N H<J> and use definition of p.. 

Define now the quantities 

N N 

r = (*/«) R Q + E (seca.-l) x.x.*, <1 = Z (seca.-l) p.p.* (8.28) 

}=1 )=1 

and note the following key identities. 

Result 8.11 The state covariance R = <DBB*<b* and r defined in (8.28) are related by 
R = r + r*+rH*Hr. (8.29) 

Furthermore, 

I + HRH* = (J+HrH*) (I+Hr*H*). (8.30) 

Proof : Substitute r from (8.28) and R from (8.23) into (8.29). Use the orthonormaliiy 

of x. with respect to H*H. This establishes (8.29). Equation (8.30) follows from (8.29) 

by forming I+HRH* from (8.29) and rearranging terms. 

Result 8.12 The dual state covariance Q in (8.26) and q 'n ,o.28) satisfy the identity 
Q = q+q* + qBB*q*. (8.3 D 

Furthermore, 

I + B*QB = (l+B*qB) (I+B*qB). (8.32) 
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Proof : Substitute Q in £3.26) end q in (8.28) into (8.31). Use the orthonormality of p^ 

with respect to BB*. This establishes (8.31). To establish (8.32), form I + B*QB using 
(8.31. and rearrange terms in the resulting equation. 

These are the preliminary results needed to evaluate the covariance of the estimation 

error associated with the filtered state estimate z . 

o 

Result 8.13 Th: filter gain g defined in (8.2) can be expresssed alternatively as 


g = rH* (I*HrH*) \ 
where r is defined in (8.28). 


(8.33) 


Proof: Substitute r from (8.28) into (8.33) and use R H* 

O 

recovers g in (8.2). 


0 and <J>.* 


x.*H* This 


Note the similarity between (8.3) and (8.33). The equation in (8.3) expresses the 
smoother gain G in terms of the state covariance R *_<PBB*<b. Eq. (8.33) is a similar 
equation for the filter gain in terms of r. The operator R in G can be interpreted as the 
state covariance. No similar piobalistic interpretation for r is known. However, its 
introduction is very useful because it allows development of formulas fo: the estimation 
error covariance and for the filter gain that very closely resemble those obtained for 
the smoothing solutions. 

Result 8.14 The covariance E(z z *) of the filtered state estimation error z * u-z 


E(z z *) * p + p*, (8.34) 

P P 

where p = p* is specified by the alternative formulas 

p = (l-gH)r(I-gH)* + gg* (8.35) 

p = r-rH* (I+HrH*) -1 ^, (8.36) 

p = (I-gH)r = r (I-gH)*, (8.3?) 

p = ( l /j) R q + £ (l-cos<ij) (8.38) 

Proof : To establish (8.34) and (8.35), substitute (8.29) in (8.16) and use the identity 

(I-gH) rH* * rH* d + HrH*)“ x * g. (8.39) 

To establish (8.36 ), observe that (8.35) implies that 

p - r-gHr - rH*g* + g(I + HrH*)g*. *3.40) 
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Substitute g * rH* (I + HrH*) -1 in (8.40) to obtain (8.36). To obtain (8.37) observe that 
the second term of (8.36) can be expressed alternatively as gHr and rH's*. To obtain 
(8.38 ), substitute r in (8.28) into (8.36) and use orthonormality of 

Result 8. IS The mean-square estimation errcu associated with the filtered state 
estimate is given by 


N 

B(Zp*z p ) = Tr |p + p*] * TrIR Q ) + 2 £ (l-cosa^) (8.41) 

)*1 

Proof : This result follows from (8.34) and (8.38). 

Result 8.16 The filter gain g can be expressed as 

g * pH*. (8.42) 

where p is related to the filtered state estimation error covariance by B(z z *) » p + p*. 

P P 

Proof : Since g = rH* (I+HrH*)~\ then g « rH* (I- Kg)* - rH* (I-g*H*) = r 

(I H*g‘)K* = r (I-gH)*h* * pH*. 


This equation is analogous tc (8.14) in that it expresses an estimator gain in terms of 
the covariance of the state estimation error. 

Result 8.17 The operators I 4 HrH* and I -HpH* are reciprocal i. e., 

(I + HrH*)~ l = I - HpH*. (8.43) 

Proof : Recall (I 4 HrH*) -1 » I-Hg « I-HpH*, where the last equality holds because 

g * pH*. 


Note that this result implies that the operator HpH* is the Fredholm resolvent of 
HrH*. The identity also immediately implies whiteness of the residuals process as 
investigated in more detail below. 

Pseudo- Innovations Properties of the Residuals 

Define the residual process in the usual way. as the di/fcret between the actual 
measurements and the predicted data emerging from the pred-cted-data-covariance 
square-root filter. i.e.« 

e * y-Hz (8.44) 

o 

This process turns out to have two key properties that are tearly identical to those of 
an innovations process: the residuals are white with a unit covariance; the residuals 
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and the measurements can be obtained from each other by means of reciprocal 
relationships. These two properties are established in the following results. 


Result 8.18 The residual process defined in (8.44) is white with a unit covariance, 

i.e., 

E(ee*) = I. (8.45) 

Proo£: Observe from (8.1) that e * (I-Hg) (y-H$cf). Hence, B(ee*) * (I-Hg) (I. HRH*) 

(I-Hg)* = I. This last equality follows from E[(y-Hd>cft (y-H<bCf)*J * I + HRH* and from 
(8.42, and (8.43). 

Result 8.19 The residuals e « y-Hz Q and the mean-centered measurement process 

y = y-HOCf can be obtained from each other by means of reciprocal linear 
transformations, i.e., 

e = (I - HpH*) y, y « (I.HrH*)e (8.46) 


where 


(I + HrH*) 1 * (I- HpH*). (8.47) 

Proof : Eq. (8.47) has been established in (8.43) and is restated here only to 

emphasize its relationship to the properties of the residual process. Eq. (8.1) implies 
e = (I-Hg)y. 

Relationships Betw eea Filtered and Stp n.-»h>n< 

While the smoothed and filtered estimates have been defined somew’ott independently 
of each other, these estimates are in fact very closely related. It is possible to ' tain 
one in terms of the other, as outlined in the following result. 

Remit 8.20 The smoothed and filtereo estimates and v are related by 
0 0 

u * z + ge, (8.48) 

o o c 

where 

e - y-Hz q (8.49) 

is the residual process, and g is the predicted-data- ovariance square --out filter gain. 

Proof : Observe that (8.1) and (8,3) Imply u • 4>Cf ♦ RH* (I + HfiH*) 1 (y-H<DCf). 

Use of (8.46) leads to u - <t>Cf + RH* (I*HrH*)“ l e Similarly, z^ in (8.1) and - in (8.33) 
lead to z « d>Cf -v rH*e. Hence, u -z - [RH* (I+K.H*)" * -rH*]e. Use of the identity 

O 0 0 

(8.29) in this implies that u 0 * z 0 * ge, which is the desired result. Note that (8.48) can 
be *>,* + r n in the alternative form 
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(6.50) 


u = a-gH)z 4 gy, 
o o 07 

Closely related to the above relationship between filtered and smoothed st/»te estimates 
is a relationship between the corresponding covariances of the state estimation errors. 
This is developed below. 

Result 8. 21 The filtered state estimation e.-or z = u-z and the residual process 

P o 

e = y-Hz are related by 
o 


e = n 4 Hz p . (8.il) 

where n is the measurement error. 

Proof: Note that e = y-Hz » Hu4n-Hz = Hm-z ) -»n * Hz + a 

o o o p 

Result 8.22 The covariance P * E(u u *) of the smoothed stst? estimation error 

p p 

u = u-u can be expressed as 
p o 

P * p 4 p* - plI*Hp, (8.52) 

where p 4 p* * E(z z *) is the covariance of the filtered state estimation error z_ *- 
P P P 

u-z . Furthermore, 
o 

I-Hf H* . (I-HpH*) (I-HpH*), (8.53) 

Pro of: Use (8.52) to obtain 

E(ez *) = B(nz *) 4 HE(z z *) » B'nz *> 4 H(p4p*). (8.54) 

P P P P P 

Now use (8.17) to compute E(nz^*), i.e., 

E(nz *) * -g*. (8 57' 

? 

since E(nc>*) = 0 by assumption. Substitution of (8.55) in (8.54) and use of j » j u* !eads 
to 

t(ez p *) * g*. (8.56) 

Since u * u-u , the” u * z - ge from (8.48). Hence, 
p o P P 

E(u u *) * £(z z *) - g B(ez *) - E(z 4 gL(ee*)g*. (8.57) 

P P P P P P 

Now use .8.34), (8.43), (8 45) and (8.f6) to rbtaiu (8.52). Equation (8.53) follows 
immediate, W from (8.52) by forming I - HPH* are; rearranging terms in tl-i resulting 
expression Note that (8.52) implies that the gains C . id g aie related by 
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G • g ♦ g - gHg. 


(e .58) 


The last three results can be viewed as a generalization to elliptic systems of 
relationships encountered in filtering and smoothing for dynamical systems. For 
example* Equation (8.48) is a generalization to elliptic systems of the forward/backward 
sweep method for solution of two-point boundary-value problems. This method in 
general terms states that the smoothed states estimates can be obtained as a result of a 
two-stage process: forward filtering by means of a Kalman filter to obtain a filtered 
state estimate and a residual process; and backward smoothing to process the residuals 
and obtain a smoothed state estimate. This two-stage data processing approach has 
been extensively studied for linear dynamical systems. Bqs. (8.48) and (8.49) have 
exactly the same structure. This structure is illustrated in Fig. 8.3. 

The ovo. all diagram illustrates how the data y^ and the deterministic input f^ are 

(31 

processed to arrive at a smoothed estimate u l \ The estimation process consists of two 

o 

stages: a FILTERIN''' stage that results in a filtered estimate z^ and a residual 

o 

process^. This filtering stage is characterized by a predictor- corrector structure 

where a predicted estimate^ is first produced and then corrected by a correction 

term.^ The results of the filtering stage are then processed by the SMOOTHING 

[81 

stage. Central to both the filtering and smoothing stages is the gain g . The 
foregoing structure is nearly identical to that of the forward/backward sweep method in 
linear dynamical systems. There are* however, some key differences. One of the 
differences is that the filtering stage in the case of dynamical systems is based on the 
Kalman filter* whereas in the elliptic case under consideration here, this filter is 
replaced by the predicted-data-covariance square-root filter. Another key difference 
is that the Kalman filter is causal whereas the predicted-data covariance square-root 
filter is not, i.e., the filter gain g is a Fredholm operator as opposed to being a Volte rra 
operator. In the same vein, the smoothing stage for dynamical systems is backward (in 
time) or anticausal. In the elliptic system case, however, the smoothing stage is also 
characterized by Fredholm operators. The notion of causality is not even introduced 
here although it is possible to do this for certain classes of elliptic systems [1]. 


FILTERING 


SMOOTHING 


r 1 ~ " ’SMOOTHED 



Fig. 8.3 Filtering and Smoothing 
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Spectral Representations: Sm oothing. Filtering. and the Residuals 


The aims here are: to obtain spectral representations for the filtered and smoothed 
estimates u q and z q and the corresponding error covariances P and p; to explore the 

predictor- corrector structure of the spectral representations of the filter and 
smoother; and to investigate the pseudo-innovations properties of the spectral 
representation of the residual process. The term “spectral representation* means the 
use of an expansion in terms of the eigensystem <J>. of R and of the related functions 

x|r. = X. x.= Xp^Bifr. and p. = X7 l d>*H*q>.. 

Result 8.23 The smoothed state estimate u q can be represented as 


u = <PCf + £ sin a a. (y. - m.) X., (8.59) 

o ) i ) ) 

where ,, - *,*7 end m, • «,*m ere the W«trd compo«nu of the deu j end the 
suspected mean m = H<t>Cf. The related observed- state estimate Hu^ is specified by 

Hu = m + HG (y-m), Hu - (I-HG) m ♦ HGy. (8.60) 

o ' o 


In spectral form, Hu 

o 


l 


u 1 <p. where 

o ) 


u l * m. + sin a o. (y.-m,), 
o ) ) 1 “ 


cos ♦ sin ^y^, 


(8.61) 


and u* » <p.*Hu . Let u * u - u denote the estimation error. The error covariances 
0)0 p o 

F = B(u u *) and HFH* « B(u *H*Hu ) are represented by 
P P P P 


F - S o 4 £ sin 2 a. x. x * HPH* - £ ain^dypj*. (8.62) 

Furthermore, the corresponding mean-square estimation errors B(u p *n^) « Tr[P] and 

E f u *H*Hu ) = Tr [HPH*] are 
P P 

E(u *u ) * Tr [5 ] ♦ £ sin a a, x.*x., B(u *H*Hu ) « £ sin a a, (8.63) 

P P o'- I ) I P P I 

Proof : To establish ( 8.59) . substitute y ■ Ey^ and m « Em^ in (8.1). To show 

( 8.60) multiply u q in (8.1) by H and recall that m • H<PCf. To establish 8.61 . multiply 

(8.60) by The equation for P in ( 8.62) follows by substitution of (8.23) in (8.8) and 

use of the conditions HR * R H* * HR H* - 0. The equation for HPH* in (8.62) 

o o o 

follows frr ht P and use of * Hx } . Bq ( 8.63) follows from (8.62) and the orthonormality 
of <ty 
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Result 8.14 The filtered state estimate z^ can be represented by 

z = <bCf + £ (1-cosa.) (y.-m.) x., 
o ) I J I 

The related observed state estimate z = Hz is 

o 

z = m + Hg(y-m), z = (I-Hg) m + Hgy. 

In spectral form, z = £ z.<p. 


(8.64) 


(8.65) 


z. = m. ♦ (1-cosa.) (y.-m.), z. * cosa. m. + (1-cosa.; y.. 
) ) ) ) ) ) ) “ i ) 


( 8 . 66 ) 


Let z = z-z denote the estimation error. The estimation error covariances H(z z *) 


p + p* and E(HZpZ^*H*) = H(p 4 p*)H* can be represented as 


P P 


P = (*/*) R q 4 £ (l-cosa^) x.Xj*, HpH* * £ (1-cosa.) $.$.*. 


(8.67) 


Furthermore, the corresponding mean- square estimation errors are 


where 


E(z *z ) = Tr(p 4 p*) f B(z *H*Hz ) * Tr [H(p 4 p*)H*], 
P P P P 


( 8 . 68 ) 


Tr IpJ = (V«) Tr [RJ 4 £ (1-cosa.) x.*x., Tr [HpH*] . £ (1-cosa.). (8.69) 

Proof : To show ( 8.64) , substitute y * Ey ^ .and m * Em jj> jinto z Q in (8.1). Bq. ( 8.65) 

follows from multiplication of (8.64) by H and use of m * H<PCf. Eq. ( 8.66) is obtained 
from (8.65) upon multiplication by and use of the orthonormality of The 

equation for p in (8.67) has been established in (8.38) and is repeated here only for 
convenience. The second of Eq. ( 8.67) follows from use of the identity <bj * Hx^. Eq. 

( 8.68) follows from the definition of p * p* in (8.38). Bq. ( 8.69) is established by 
performing the trace operation on (8.67). 


Result 8.25 


The residual process e * y-Hz Q can be represented as 


e = £ e.<bj, e^ = <|>.*e. (8.70) 

The spectral components e. are independent random variables with zero-mean and unit 
covariance, i.e., 


B(e.e.) = 0 for i »tj, B(e*) * 1. 


(8.71) 
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Furthermore, the spectral components e^ and of the residual and difference processes 
e = y-Hz Q and y - y-m are related by the reciprocal relationships 


e. = cosa. y. f y. = seca. e.. (8.72) 

Proof ; Eq. ( 8.70) is valid because 4> .are orthonormal in H To show ( 8.71) , observe 

that E(e.e.) = <t>* 3(ee*) 4>. and then use (8.4S) and the orthonormality of 4>.. Equations 

(8.72) are the spectral representations of the reciprocal relationships (8.47). Note that 
(8.72) can also be established by the simple trigonometric identity (1/cosol) * seccr. 

9. NUMERICAL SEARCH CALCULATION SUMMARY 

Since the development of the estimation- approach is rather lengthy, it is 
convenient to summarize the steps that are required to implement the numerical search. 
It is assumed that the process starts with a known input f, a set of data y and an 

initial parameter estimate 0 n . To conduct an iteration in the numerical search requires 
that the following steps be performed: 

1. Compute the suspected mean and covariance m = H$Cf and R « Hd>BB*0*H*. 

2. Compute the eigenvalues and eigenvectors of R. 

3. Compute the related vectors p^ * = B*p^ and 

4. Conduct a spectral analysis of the data and of the suspected mean to obtain the 
spectral coefficients y^. = 4> k *y *nd m^ = 4>^.*m. 

5. Use Result 6.5 to evaluate the gradient dJ/36 of the likelihood functionaL 

6. Use Results 6.6 and 6.7 to compute the Hessian M q and to determine the 
incremental change 68 n of the parameter estimates. 

7. Obtain a new parameter estimate 0 n+ * * 0 n - 60 n , return to step 1 above, and 
iterate through steps 1 to 6 until convergence is achieved. 

If Cramer- R so bounds and/or an optimal input are desired use (7.6) - (7.13). If the 
covariance of the state estimation error is desired use Result 8.2 and/or 8.13. 

The calculations involved in conducting a single iteration in the maximum-likelihood 
parameter estimation approach are summarized in block diagram form in Fig. 9.1 A 
single iteration consists of all of the computational steps required to obtain an updated 

parameter estimate 0 n+1 by processing the available data, the known deterministic 

input, and the current parameter estimate © n . 


371 



wuAH-ioot mrut 



Fig. 9.1 Calculations Required for Single Iteration in Modified Newton- Saphson Search 

To simplify the description of these computations, the steps performed in a single 
iteration have been grouped into the following four major blocks (delineated by the 
broken lines in the diagram): 

• a SQUARE-ROOT FILTER block that processes the measurement data y and 
the external input f to obtain a filtered estimate z and a corresponding 
residual process e, defined as the difference between the data and the 
filtered state estimate. The square-root filter implements the equations z » 
Ly 4 (I-L)m and e « y-z. The central computation in the square-root filter 

-Vi 

block is that provided by the operator L » I - (UR) defined in terms of the 
square-root of the predicted-data- covariance (I ♦ R). This operator appears 
in two distinct places in the diagram: in the data filter , whose primary 
function is to process the measurements y; and in the mean filter, whose 
main function is to process the suspected mean m. The suspected mean is in 
turn obtained from the known external input by means of the input-output 
model. 
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• * SQUARE- ROOT FILTER SENSITIVITY block that processes the 
measurement data y and the deterministic input f to obtain the filtered 
estimate sensitivity dz/36. This block implements the equation dz/86 = 
OL/36)(y-m) 4 (I-LWdm/39). The computation of the sensitivity dL/36 is the 
main calculation performed in this block. 

• a GRADIENT-HESSIAN SYNTHESIS block that forms: the function-space 
gradient dj/d6 of the likelihood functional by means of the equation 3J/36 = 
TrtOL/dOXUK) - (3z/30)e*]s and the function-space approximate Hessian by 
means of the equation M - Tr [ OL/aOXU RXdL */80)] 4 Oz/30)*Oz/ae). Note 
that the quantity that is actually evaluated in this block is the kernel M (x/£) 
of the Hessian operator. This kernel is a function of two spatial variables x 
and C defined over a "square* domain (x/£)cfi x Q, where Q is as before the 
spatial domain of definition of the system model. 

• a NEWTON- RAPHSON ITERATION block whose input is the gradient and the 
approximate Hessian and that generates as an output the updated parameter 

distribution 8 n+ 1 for the next iteration. The central calculation in this block 

is the solution of the integral equation M Q 66 n » g Q that results in the 

parameter estimate update 66 n . 

After specification of the parameter estimate 0 n+ ^, the square-root filter L(6°) and its 

sensitivity dUQ^/dO are redesigned by letting 0 n -*0 n+l , and the steps outlined above 
are repeated in order to conduct the next step in the iterative process for optimization. 

The predicted-data- covariance square-root filter processes the data y and the 

suspected mean m to produce a filtered state estimate z and a set of residuals e ■ y-z. 

_i/ t 

This is done by means of the equation z > Ly 4 (I-Lftn, where L * i-(UR) . This 
equation, while providing a very succinct symbolic description of the square-root filter, 
does not by itself provide a recipe to conduct computations. In order to provide such a 
recipe, it is convenient to use the corresponding spectral form z^ * (l-coso^)y^ 4 

cosa^m^, which expresses the spectral amplitudes z^ « $^,*z of the filtered state 

estimate z as a linear combination of the data and suspected mean spectral amplitudes 
y^ and m^. Such a spectral form of the predicted-data- covariance square-root filter is 

illustrated in Fig. 9.2. 

The diagram in the figure illustrates the main calculations involved in the square-root 
filter. On the upper branch of the diagram, a set of data^ y *. Iy^,...»yj t j] is assumed to 

[2] 

be available at N discrete locations. A spectral analysis is conducted on this data to 
obtain the data spectral amplitudes^ ly l ,...,y N ]. These spectral amplitudes are 

then multiplied by the coefficients (l-cosa^) in the data filter^, resulting in the terms 

(l-cosa^)y*. On the lower branch of the diagram, the deterministic inputs f| 6 ^ are 


373 



processed by the input/output system model^ to obtain the suspected mean 
m * (m 1 ,...,m N J*- 8 l The spectral amplitudes m * m PI of the suspected mean are 

then computed and subsequently multiplied by the coefficients cosa^ in the mean 

filter^ to produce the terms cosa^m* ^ . This last term is then added to 

(l-cosa^ly* in^ resulting in the filtered state spectral amplitudes and the 

1131 

residuals . Note that the physical state estimate z and the residual e can be 
recovered from z^ and e^ by means of the summations z « Ez^4>^ and e = 
although for simplicity this last transformation is not shown on the diagram. 



Fig. 9.2 Spectral Form of Predicted-Data-Covariance Square-Root Filter 

The foregoing remarks have scrutinized the spectral form of the square-root filter 
equation z = Ly 4 (I- Dm. The immediate aim now is to conduct a similar detailed 
analysis of the spectral representation of the square-root filter sensitivity equation 
dz/dO « OL/36)y 4 il-D (dm/38). The spectral form of this equation is stated in Bq. 
(6.15) and illustrated in the block diagram in Pig. 9.3. The overall primary 
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function of the square-root filter sensitivity is to process the N mean-centered data 

spectral amplitudes^ and the M deterministic inputs^ in order to obtain the spectral 
(31 

amplitudes of of the filtered state estimate sensitivity dz/86. An intermediate 
calculation embedded within this overall process involves processing of the 

mean-centered data spectral amplitudes y*^ by means of the N-by-N matrix, with 

general elements a^, representing the data filter sensitivity 8L/30^. Other 

intermediate steps involve: processing of the deterministic inputs f ^ by the 

xn 

input/output model sensitivity matrix b km 151 to generate the suspected mean spectral 
amplitudes (dm/36)^ and subsequent processing of these coefficients by the mean 
filter ^ to obtain the terms cosa^ Om/30)^ 



Fig. 9.3 Spectral Form of Square- Root Filter Sensitivity 

10 . CONCLUDING RBMARKS AND FUTURB DIRECTIONS 

The a of estimation for elliptic systems is so full of interesting research 
problems that, in spite of all that this paper has covered, much more remains to be 
done. These are some of the problems that lie ahead: 

• Conduct of an asymptotic statistical property analysis that explores the 
convergence of the parameter estimates as the number of observations 
increases. 
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• Development of approximation approaches that rigorously arrive at 

finite-dimensional approximations to the infinite-dimensional solutions 
advanced here. 

• More complete investigation of the optimal input design problem. In 

particulari development of "spectral" domain design approaches which would 
do for elliptic systems what the frequency domain methods achieve for linear 
time - invariant dynamical systems. 

• Development of more precise mathematical arguments to Justify 

function- space differentiation, eigensystem expansions, covariance 

calculations, likelihood-ratio derivations, etc. 

• Investigation of alternative (to the square-root) factorization of the 

predicted-data- covariance that could result in easier calculation of the 
function- space derivatives necessary for the Newton- Raphson search. 

• Numerical experimentation with the filtering, smoothing and identification 
algorithms to gain further insight into the state and parameter estimation 
approaches and solutions [5]. 

As a final remark, this paper is a concrete example of the power of the functional 
analysis approach to estimation advanced in Ref. [4]. Because of the conceptual 
simplicity of the method, it has been possible to solve in this paper problems that would 
have defied solution by any other method. It has also made it possible to conceive areas 
for future research that would otherwise have been left unidentified. 
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I. INTRODUCTION 


Many important issues in the control of Urge space structures are intimately 
reUted to the fundamental problem of parameter identification. Very often, a 
complicated structure can be adequately modeled for certain operations by the 
fitting of a rather simple model with a number of free parameters. This simple 
model then can be referenced for necessary control operations. Important 
applications include the many space station designs which are based on the assembly 
and Joining of discrete module* by crew memoers. This crew-assisted construction 
will result in a configuration * oich is a Urge-scale composite of many structural 
elements and whose static and dynamic characteristics cannot be adequately 
modeled in advance. In fact, any modeling will require periodic updating as more 
modules are added to the system and as the structural properties of the elements 
slowly change over the lifetime of the station. 

One might also ask how well this identification process can be carried out in the 
presence of noisy data since no sensor system U perfect. With these considerations 
in mind our algorithms are designed to treat both the case of uncertainties in the 
modeling and uncertainties in the data. 

This paper serves as a companion to [6] where the analytical aspects of 
maximum likelihood identification are considered in some detail. Here we focus on 
the questions relevant to the implementation of these schemes, particuUrly as they 
apply to models of Urge space structures. Our emphatU will be *n the influence of 
the infinite -dimensional character of the problem on finite- dimensional 
implementations of the algorithms. We highlight those areas of currer . and future 
analysis which indicate the intezpUy between error analysU and possibU truncations 
of the state and parameter spaces. 

2. MODELS 


As in (6], we consider the systems of the form 
A(©)u(e) » a B(6)<i> + C(©)f 

<j) 

( 2 . 1 ) 

y(0) = H(6)u(6) to^T) 

Here A is a formally self-adjoint elliptic differentUl operator defined over the 
spatUl domain ft; the integral operator <t> is reUted to A by 

A<t> - I , (2.2) 
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where I is the identity. B end C are appropriately dimensional operators that model 
the influence of the process error o and the input f on the state u. H is an 
operator that characterizes the state-to-observation mapi <■> amd n are model 
errors that form the model error vector 

e = (o, T|]t (2. 3) 

and f is a deterministic input. Conceptually the error vector £ represents spatial 
white noise and is characterized by the covariance operator 

E(££*) - I. 

a and cr are non- negative scalar weighting parameters that respectively 

u) T| 

measure the relative importance of the modeling error and the measurement error. 
Thus, the limit cr -* 0 corresponds to the case of perfect modeling while the limit 

-* 0 corresponds to the case of perfect measurements. 

6 is the possibly infinite-dimensional parameter which must be estimated. For 
simplicity we shall generally consider cases where the parametez dependence is 
restricted to the operator A. Furthermore, we assume as in [6] that the parameter 
enters linearly into the expression for the potential energy of the system. Thus we 
assume 


A(0hi * D*(6Du) 


(2.4) 


where D* denotes the formal adjoint of Dj the corresponding potential energy is 
given in terms of the appropriate state-space inner-product: 

<A(6)u,u> » <8Du,Du>. (2.~, 

And finally, the deterministic and stochastic forcings will be localized to 
discrete points which might correspond to actuator locatir's. Similarly, the 
observation map returns a vector of observations at discrete points which might 
correspond to sensor locations. We assumed that there are N point-sensors at 

locations {f ^ and point- actuations at locations {f 


Because of these last assumptions, many of the relevant calculations outlined in 
[6] reduce to matrix and vector manipulations, hi this paper the notation X will 

refer to a finite-dimensional vector whose k-th component is given by g^\ 

Similarly G is the notation for a matrix whose (i,))- component .is given by The 

relevant dimensions of vector and matrix quantities will always be clear from the 
context. 


After taking formal limits in the system (2.1) we have: 

Au * a B^ ♦ cT 

a 

f - Hu + zf 


( 2 . 6 ) 
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where 


- f AN >VT 

o * (tt ,...» o a } 

(2.7a) 

- - Ci) (NAT 

T) s (T1 5 ) 

(2.7b) 

Hu(x) ■ (u(£ u(fj^ ) T 

(2.7c) 

s 


N , 


H ♦T*E6(x-C i )g (i) 

(2.7d) 

i=l 


N 

• 


cF= £ 6(1-^) f (l) 


i=l 

(2.7e) 

C*u(x) » (u(f i ) u(J N )) T 

(2.7f) 

* 


N . 


Bo = l 6 (x-^) o (l) 

(2.7g) 

i=l 


B*u(x) - (u(f y u(f N )) T 

(2.7h) 


a 


*m in the cate of more general sensing and actuating systems, the modeling 
requirements for the system can be reduced to solving equations of the form 

Au * f. (2.8) 


Thus the discussion in this section will focus on how the infinite-dimensional 
structure of the system (2.8) influences the choice of finite-dimensional 
approximations which can be made. In this paper we consider two specific 
structural models: a string under tension and a wrap-rib antenru. 


Let £ be a distance coordinate measured in meters along a string of Length L which 
is also given in meters. Let u(x) be the displacement in meters and let <£(x) the 
tension parameter given in units of newtons. The forcing density is given by f(x) in 
units of newtons/meter. Then the energy potential [3> is yiven by 


A A 

V(u) 


if 


-1 


a(xXu'(x)) a d£ - f(r) u(x) d£. 


(2.9) 


379 



Of course the energy potential is given in units of newton- meters. The equations of 
motion can be derived immediately based on the principles of the calculus of 
variations but it will be convenient to first transform to dimensionless coordinates. 
Let a, be some characteristic value of the tension parameter. We introduce the 

dimensionless variables: 

A 

X 

X = 

L 

a<x) 

a(x) . 


u(x) = 


A A 

u(x) 


L 


aa 


f(x)L 


f(x) = — 

A 

a* 


( 2 . 10 ) 


A A 

V(u) 


V(u) = — 

A 

a*L 

and the potential expression becomes 


V(u) 


•j" f a(x) (u'(x)) a dx - f 

Jo Jo 


f(x)u(x)dx. 


( 2 . 11 ) 


Por simplicity we prescribe boundary conditions corresponding to fixed end points: 

u(o) >u(l)>o (2.12) 

Then arguments based on the calculus of variations give the system 
(a(x)u'(x))' - f(x) 

( 2 . 13 ) 

u(o) » u(l) * 0 0 < x < 1 

This example has been studied many times in the classical literature but an 
analogous approach gives comparable expressions for much more complex systems. 

We consider now a planar model for a wrap-rib antenna which is used to study 
out-of-plane vibrations (see Figure 1). The antenna model comprises N gores 
(subsections) modeled by ‘nterconnected ribs and mesh. Since the transformations 
are similar to those used in the case of the string, we immediately write the 
potential expression with dimensionless coordinates. Let the vector of rib 

displacements be u(r) where the k-th component of vTis the displacement of the 
k-th rib (0 < r < 1). Let the vector of mesh displacements be"?tr,6) where the k-th 
component of Vis the displacement of the k-th mesh sector (0 < r < 1, 0 < 6 < 1). 
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Fig. 1 Simplified model for wrap-rib antenna 

Based on analysis of actual antenna designs, our model consists of N identical 
beams fixed at a central bub. Stretched between the beams are N identical 
anisotropic membranes. The potential equation is given by 



f 1 - * _ fl f 1 - * _ 

- I F D u dr - I I F„ vrdrd© 

Jo R J 0 Jo " 

Here the coefficients {G^} are related to the physical parameters of the beams 
and membranes thusly: 


El 

o 

V — 

oL 

T e L a 
r o 

G = (2.15) 

a 

T L a 

T e L 

V 

o9 

o 
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E and I are respectively the Young's modulus and the moment of inertia of the 
beams. T and Tq are respectively the radial and circumferential tensions of the 
membrane. L is the radius of the antenna and 9 q is the angular width of a sector; 
that is, we have 


2w 

8 = , ( 2 . 16 ) 

o 

N 

where N is the number of gores. Finally, a is some convenient scaling parameter 
with the dimensions of energy (nt-m). We note that the physical forcing densities 

and Fjg having respective dimensions nt/m and nt/m* were rescaled according to 


- L 7T 

^R = _ P R 


F M = “ P M 


( 2 . 17 ) 


Appropriate geometrical boundary conditions follow from fixing the center and 
attaching each of the ribs to its adjoining membranes: 


u| 


dr 


u| 


r=o 


r*o 


( 2 . 18 ) 


7 1 = C v| j u 

9=0 9=1 

Here C is an N x N periodic matrix: 

'0 1 


( 2 . 19 ) 


. 1 
0 
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As in the case of the string, the equations now follow from arguments based on 
the calculus of variations: 




(2.20a) 


1 


r 



(2.20b) 


with the additional natural boi*ndary conditions: 


8*\T 

dr* 


« 0 


r=l 


3*u 

** r=l 


dv* 

”5T 



( 2 . 21 ) 


One of the focuses of this study is the consideration of nonmodal approaches in 
the finite -dimensional approximation schemes. In practice, this generally will mean 
directly solving a linear system of equations rather than proceeding from some 
finite modal synthesis. But the infinite -dimensional structure of the system (2.8) 
also can influence the particular finite-dimensional approximation schemes used. 
Our approach is sufficiently general so that any adequate finite e 1 lent model of 
the system (2.8) should yield adequate numerical approximations. Bv one can often 
do much better for a particular model or a particular class of models. 


We use the antenna model to illustrate the point and make some observations 
that should influence the approximation schemes regardless of which finite element 
or finite difference scheme is employed. We emphasize that these considerations 
also apply to much more complicated antenna models which share salient features 
with the system (2.18) - (2.21). First we note that the structure is periodic in the 
6-direction. This cyclic symmetry leads to considerable savings in the computation 
of solutions to (2.8). This can be deduced from either the differential equation or 
the energy expression (2.14). The periodic matrix C can be diagonalized by means of 
a finite Fourier transform [1]. That is, let U be the N x N matrix whose (j,k) 
component has the form ~ 


0,k) 

U 


exp (i -^j — 0-1) (k-1)) 
J N 


( 2 . 22 ) 
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We then have: 


U*U * I 

<">w esv '"W» 

U*CU = A = diag exp (i (j-1)) (2.23) 

~ ~~ ~ N 

j 

This transformation decouples the system since the potential expression (2.10) 

with 


U*u 


7 = U*v 

IT = U*F 
*R ~ r R 

F., = U*F 

M ~ m 


(2.24) 


has the same form as the original system except that the matrix C is replaced by 
the diagnonal matrix A. The differential system (2.20) is likewise transformed. 
Thus any particular solution of (2.8) can be expressed in terms of N subsystems each 
comprising a single rib counled to a single membrane. Since the cost of solving a 

m - dimensions 1 linear system is 0(m*) this represents a considerable computational 
savings. 


The balancing of terms in the equation also can influence the choice of 
discretization. Based on a report by Lockheed on the specifications for a 55-meter 
wrap-rib antenna with 48 ribs [2], the following nominal parameter ranges were 
derive : 


L ~ 27.5m 

6 ~ 1.31 1(T X 
o 

I ~ 1.31 1(T* m 4 

(2-25) 

B ~ 9.72 10 l ° nt/m* 

T r ~ 1.75 10~ x nt/m 

T 0 ~ 3.50 10' 1 nt/m 

This lives the proper scalings in system (2.20). For simplicity we take 

a - T, L */Q and we have 
G o 
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2.29 


G ~ 

1 

G ~ 8.58 10“’ (2.26) 

2 

G ~ 1 

s 

This nesni that the radial terms of the mesh potential are comparatively small 
except when the radial derivatives are large. How this affects the structure of the 
system is demonstrated in the following example (see Figure 2). 



Fig. 2 Single antenna gore 


Example: 


1 3 / 3v\ 1 

T ~5F \~SFf ~T 


_3fv 

30 a 


v (r,0) 

0*o 

»f (r) 
i 

o < r < 1 
o < 0 < 1 

(2.27) 

v (r,0) 

0*1 

* f (r) 
2 

(f (o)*f (o) * o) 
1 2 


5T (r,6) | 

* 0 




r»l 


hi example we study the equations for a sector of membrane where the 
prescribed boundary conditions depend on the adjoining rib displacements (Mr) and 
Mr)). For simplicity we take the forcing on the mesh to be aero although the more 
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general case coaid be handled in a similar fashion. We are of course interested in 
the case where 


0 < e « 1 


(2.28) 


which corresponds to the parameter ranges (2.26) in (2.20). Physically one expects 
that the radial terms contribute little to the static behavior except perhaps at the 
boundary where the gradients may become large. One is also interested in the 
behavior near the comers (r,0) * (1,0) and (r,6) * (1,1) since some «faignUr behavior 
may be possible. Using the techniques of singular perturbations, (see for example 
[4]), one can show that as e approaches zero we have 

v(r,0) = f (r) (1-0) ♦ f (r)0 
1 2 


♦ e V * Z ( - f Vl> ♦ f ' (l>(-lf ) 

♦ 0(e) 


2 

(nrr ) 2 


exp 



sin (mrO) 

(2.29) 


This expansion could be continued to higher orders, and, as noted before, a more 
complicated expression would result from forcings on the membrane. One possible 
approach to the numerical solution of the system (2.20) would by the elimination of 
the mesh behavior entirely be substituting an expression similar to (2.29) into the 
beam equations (2.20a). Then one would have only equations along the beams to 
solve. If higher-order accuracy on the mesh is required, one could then apply 
finite-element techniques to the system obtained after linearization about the 
asymptotic expansion for the mesh behavior. Finally we note the appearance of 
logarithmic singularities in the mesh gradients (dv/dv, dv/90) as the comers (r,0) = 
(1,0) and (r,9) * (1,1) are approached from the interior of the mesh. This 
consideration should also influence any finite element approximation of the mesh. 

We emphasize that this analysis applies not only to the simplified antenna 
model we have considered but would hold for more elaborate configurations where a 
similar structural balance of terms governs the system. Thus, many modeling 
options can be considered for parameter identification in important classes of 
structures if one does not insist on a traditional modal characterization of the 
system. 

3. THE LIKELIHOOD FUNCTIONAL 

A detailed discussion of the likelihood principle is given in [6]. The functional 
we consider is the negative logarithm of the likelihood ratio associated with the 
detection of a Gaussian signal in additive Gaussian noise; this framework is 
traditional in the theory of communication and signal detection. 

In accordance with the discussion given in [6], and the notation discussed in 
Section 2 ,the log-likelihood functional is given by, 
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(3.1) 


J(6,y) s^Tr (log [o^+ o*R]) 

♦j(y-n5(0))» [o*jU o^R] _1 (y-®©)) 

-i a*/ pj 


where the expected mean and the covariance operator are given by: 


mC6) = H(0) <t>(0) C(0) f 

(3.2) 

R(0) = H(0)d>(0)B(0)B*(0)<b*(0)H*(0) 

Here y^*y^ indicates the Euclidian inner product in the N^-dimentional space to 
which the observators belong. 

From the assumptions of Section 2, it is easy to see that R is an N x N matrix 

<v |5 

whose (i,j) component is given by 


R (i,i) 


k=l 


(3.3) 


8(^)8 <*,!?*> 

k-l 


where g(x|£) is the point-source solution of the underlying elliptic system 

Au 3 6(x - 5) (3.4) 

with the appropriate boundary conditions. And likewise the expected observation 
has the form 


55 = G f 

> 

r. ( f (i) W 


(3.5) 


G (i,,) - g(c i ie ) ) 

where G is an N xN matrix. 

— s a 

We note that (3.1) also differs from (1.3) of [6] in accordance with the 
introduction of the positive weighting parameters, and o^, into the system (2.1). 
An equivalent form for the likelihood functional follows from a rearrangement of 
terms. 
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(3.6) 


J(0,yj = *2 Tr log [I^+ uR] 

♦ (1 /a^ a (y^3(0))* [I^+ vR] -1 (y^m(0) 
- (1/cM 2 y*y 


where 


V - 



(3.7) 


This form is useful since one can arrive at the functional given in [6] directly by 
the substitutions 

A 

R ■* ii R = R 

Aw 

y -» (1 ~ y (3.8) 

in -* U/a ) m * m 
ti 


This correspondence allows one to use the algorithms derived in [6] directly on the 
functional 


J (0,y) =4 Tr Log l]U R] 

+4 (y - *b( 0))* [i + £] -1 (y - m) 


1 a A 

- T y*y 


(3.9) 


The goal is to find the parameter value 0 which minimizes the log-likelihood 
functional; that is, we wish to solve 

min 1(0, y) , (3.10) 

0 


where 0 ranges over some appropriate infinite-dimensional space. Assuming that 
the functional hr.s a Prechet derivative and satisfies an appropriate convexity 
condition, one can restate the problem (3.10) as 

81/30 (0,y) « 0 (3.11) 

Both problems (3.10) and (3.11) have been studied in a variety of contexts (See, for 
example, [5]). 

Since both the parameter space and the state space are infinite-dimensional, 
one must make dual approximations in order to achieve problems that are 
finite -dimensional and therefore computationally tractable. Thus in practice one 
solves a sequence of problems of the form 
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(3.12) 


AAA 

min J(0,y) 


£ 


or 

af/ae (£,y) - o 


(3.13) 


where the state-space and parameter- space have been replaced by finite 
dimensional spaces. Then the problem reduces to a finite minimi sation problem 
which can be treated numerically by a variety of techniques (see, for example, [1,5]). 


The state-space can be approximated by a finite -element space which is 
appropriate for approximating solutions to (3.4), and the parameter-space can be 
conveniently represented by a spline-based space. Let be the dimension of the 

finite-dimensional approximation to the state-space and let Nq be the dimensions of 
the finite-dimensional parameter space. This leads to the natural substitutions 


N 


u -» £ u 
k=l 


(k) 


♦k ( *> 


N e 

e -» i e (k> <c k (6) 
k»l 


0 a [6 


,(i) 




(3.14) 


U a [U 


(1) 




> 


where the sets and {r^} give the basis elements for the state and parameter 
spaces respectively. 

It will also be convenient to consider the state-space inner-product with a 
weighting given by the basis elements of the parameter space. Thus we define 


< U,V = <U, lfj(0)V> 
0 • {!»...» Nq}) 


(3.15) 


In the following we restrict our attention to these finite-dimensional problems, 
and, when the context is clear, we suppress the ''-notation. Questions concerning 
the convergence of the numerical schemes and the general relationship between the 
infinite -dimensional and finite -dimensional problems will be discussed more fully in 
a future report. 
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Most nonlinear optimization techniques require solving linearized systems 
iteratively, and consequently one must solve systems of the form (3.4), where the 
dimension may be quite large. Since the complexity of solving an m-dimensional 

linear system is 0(m*), the speed of convergence of the iterates is an important 
consideration. With this in mind, we emphasize the use of quasi-Newton methods 
for the solution of (3.13). Consequently much of the resulting effort is directed 
towards deriving adequate approximations for the Nq- dimensional Jacobian vector 

3J/30 and the Nq x Nq Hessian matrix 3 a J/30 2 . 

We briefly outline the procedure here; as noted previously, a more complete 
description is given in [6]. In general for the finite problems, the dimension of the 
state space (N^) is much larger than the dimension of the parameter space (Nq), the 

number of sensors (N ) or the number of actuators (N ), and so it is preferable to 

S A 

carry out the necessary manipulations in spaces whose dimensions do not depend on 
the dimension of the state space. 


Therefore, as in [6] we represent calculations in terms of the eigen-structure of 

the N x N matrix R. 
s s ~ 



= X 



X k ‘ M a k <° 5 a r < 7 ) 


(3.16) 


From the spectral components of R we define useful quantities as given in [6]. 

From (3.2) we have the expected observation 

m = Hd>c7 

(3.17) 

“k * V 5 


and also we define the filtered observation 


"z" = Ly* + (I - L) m 

~ ~ ~ (3.18) 

\ ■ V r 

where the N x N matrix L is given by 

S3 ^ 

_y. 

L = I - (I + R) 

~ ~ ~ ~ (3.19) 

= T. (1 - COSG^) 

k 

and the related matrix K is given by 
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(3.20) 


K - a ♦ R) Vl - I 

r>J fKj ^ 

• £ (seca^ - 1) <? k ^ k * 
k 


For algebraic convenience we also define the residual of the process: 
"e = y-7 

e k 3 V r 

The gradient of L is represented by 

3L/ae (,> - z E.’ n AV 

k m#k 


where the coefficients {aj^} are given by: 


- (sin a^)* <Dp k> Dj^ >. 


, k « m 


‘km 


((X^ X k ) / (\£ - \^)) (cosc^ - cosa m ) • 

[x k <D »m' ^ V X m <Dp k' > )> 


, k 4 m 

J 


For later convenience we derive another form for the coefficients {a^ |y }. 
standard trigonometric identities one can easily verify the relation 


cosa. - cosa (cosa .) a (cosa ) a 

* m k m 

(tana. ) a - (tana ) a cosa + cosa. 

km tn k 

This leads to an alternate form for the coefficients 

- ( coso^* t(X k >* <Dp fc , > } ] , k-m 

- (cosa k cosa m > a / (coso^ + cosa m > . 

IX m V J <Dp m’ ^ V H K m <D *V ^m >f ’ k * m 



(3.21) 


(3.22) 


(3.23) 


Using 


(3.24) 


(3.2S) 
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The point of this last derivation is that the bracketed terms reduce to simpler 
expressions. From (2.2), (2.7), and (3.4) one can easily show: 


■ V ♦* H * ^ 

N 

= x’ 1 z g (x|f ^ 

j*l 


and also from (2.7) we have: 


x 


m 


B*p 


m 



k=l i=l , 


(3.26) 


(3.27) 


By this we have: 

^m^k'^mV^k^m 


where the N x N matrix A. has the form 
s s 

N a 

Aj k,m) *Z g <Dg (x|f k ), Dg (x|C t ) >j 

i*l 


(3.28) 


(3.29) 


And similarly we have the useful relation 


x k <D?k, ray Vl| f 


(3.3b) 


u * d>cf 

where the N x N matrix B. is given by 

8 A 

B (k,m) = <Dg (x(Ck)i Dg (x,r m »j 

We now give expressions for the gradient and the Hessian in terms 
quantities given above. As in [6] the gradient can be represented as 

aj/ae (i) = -z sin* un <Dr k . 

-Ee k Oz k /30 (i) ) 


(3.31) 
of the 


(3.32) 


Here the spectral coefficients 32^/30 


0 > 


are given by 
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(3.33) 


a V ae< ” * 1 CMa m * L e k 

m 

- (cosa^) CDpj,, DS>jJ 

Exact expressions for the Hessian are given in [6]s in general, however, all 
terms need not be estimated to give an adequate approximation. In particular, the 
calculations are much simpler if the terms with second-order derivatives can be 
ignored. The simplest approximation comes from only keeping those terms which 
contribute to the expected vt of the Hessian. Thus, from (1.12) of [6] the 
(i,j) - component of the N x N Hessian approximation M is given by 

8 S < “ w 


* tr i3L/ae (1) (i + K) 3L/ae (J) a ♦ K) i 


(3z?30 (i) ) * (3z736 (j) ) 

(3.34) 

* tr tXi Xj] ♦ E eyae (i) dz k /3e (}) 


where by (3.20) and (3.22) we have vj m,k> - aj^seco^. 



This estimate is justified when the covariance is small as one might expect if the 
number of measurements is large. This point will be investigated more rigorously ir 
a future paper. 


We now summarize the search procedure for the system (2.6) where the 
N s -dimensional observation vector y" is given and an initial N q -dimensional 
parameter estimate 9 q is available. 

First the expected observation m and the covariance matrix R are determined from 
(3.3) and (3.S). The spectral decomposition of R as well as the quantities given by (b.17) 
- (3.33) then can be determined by standard "matrix algebra routines. And therefore 

from (3.32), (3.33) and (3.34) one obtains an N -dimensional gradient approximation g 

z 

and an N x N Her-ian approximation M. 

The parameter estimate 6 q can then be updated by making the quasi-Newton 
correction: 

6 t »9 - y M _1 g" (3.35) 

* o o ~ 

Here y q is an appropriate scalar chosen to improve the updated parameter 

estimate. In accordance w? i the geuera) theory of Ne'-rton iterations in function 
spaces [5], one can repeat this procedure until the solutions of the linearized problems 
converge to the solution of the underlying nonlinear problem. 

This analysis completes our outline of the maximum likelihood identification 
process. In Section 4 we give examples which illustrate the successful 
implementation of these schemes in useful applications. 
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•>. EXAMPLES: In this section we give examples of successful implementations of the 
previously discussed algorithms. 


We first consider the string (cf (2.13) 
Au = - (a(x) u'(x))' 


u(o) = u(l) = 0 


(4.1) 


For the case where the unknown tension parameter is constant, the point-source 
solution can be explicitly given: 

u(x|£) = * (1 - *>)(*<•> 

x^^ = max {x,f } (4.2) 

x < = min {x,f } 

And thus, as outlined in Section 2, all calculations could be given in terms of these 
quantities, without any truncation of the state space or the parameter space. 

In general, however, truncations in both spaces are necessary. For the string 
problem we consider an - dimensional state space of linear splines; the state 

variable then beccmes the vector of nodal values on the corresponding grid. For 
simplicity we take the grid to be uniform; thus, since the endpoints x = 0 and x = 1 
are fixed, we have: 


1 

Ax = 

N + 1 
x 

The state-space elements are then given by 
N 

r x (i) , , 

U = E U IC.(x) 
i=l 


where, as illustrated in Figure 3, the basis elements (r^x)} have the form 


! x -(i-l)Ax 
Ax 

x -(i)Ax 
Ax 


, (i-1) Ax < x < (i) Ax 


, (i) Ax < x < (i+1) Ax 


(4.3) 


(4.4) 


(4.5) 


0 


otherwise 




Pig. 3 Linear spline elements 

A similar discretization of the parameter space is possible. First we consider the 
augmented spline space 

N +1 

{r^xW W.6) 

1 UO 

where, as illustrated by Figure 3, the endpoint- elements r Q and + j are given by 

x 
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, 0 < x < Ax 


(Ax) - x 
Ax 

0 , A x < x < 1 

(4.7) 

0 , 0< x < (Ax) 

x - N x (Ax) 

, N (Ax) < x < 1 

x 

Ax 

Thus we have a corresponding parameter element 

N x +1 

a(x) = E a (l+1> r^x) (4.8) 

Uo 

which would give a parameter space {51 with dimension N^+2. 

However, as previously noted, the resolution of the parameter space often does 
not need to he as fine as the resolution of the state- space. We consider then the use 
of a piecewise linear parameter space of lower dimension where the only 
requirement is that the nodal points must be a subset of the nodal points of the 
state-space. The new parameter space is then a subset of the 2)- dimensional 

space given by (4.6). Let a be an Nq -dimensional parameter element (Nq S N^+2). 
Then a identifies with an element <T of the larger (N^ 2)- dimensional space and the 
relationship is given by 

a - £a (4.9) 

where B is an (N -t 2) x N Q matrix. And correspondingly, we have 

^ X V 

3a/3a = B . (4.10) 

This relationship simplifies the algorithms as described below since B is easy to 
construct, and the more cumbersome calculations which are needed to determine 
partial derivatives with respect to the parameter space are then specified in terms 
of the grid associated with the state space. Thus we have: 

a/aa-^B d/aa (4 . id 
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We illustrate these points with a sample calculation (see Fig. 4). We consider 
the case where there are seven sensors at the locations 

£ e {.125, .25, .375, .5, .625, .75, .35} (4.12) 


AAA 
S S S S S S S 

• 1 I 1 1 I I 1 . . « 

0.0 0.25 0.5 0.75 1 


S: SENSOR LOCATION 
A: ACTUATOR LOCATION 


rig. 4 String tension identification: sensor and actuator locations 
and three actuators at the locations 

£ e {.25, .5, .75}. (4.13) 

The data vector was derived from a plant with specifications 


plant 


(x) = 3 + x 


C f = 6(x- .25) +6(x-.5) + 6(x-.75) 


a = .001 
u 


(4.14) 


o = .001 
h 

For the state space we take the seven-dimensional space of linear splines {r^(x)} 

with nodes corresponding to the sensor locations (4.12), and for the parameter space 
we take the five -dimensional subset of linear splines with nodes corresponding to 
the set 


{0, .25, .5, .75, 1.} (4.15) 

The relaxation parameter y Q in (3.35) was chosen to speed up to the convergence of 

the iteration; these issues will be discussed more fully in a future report but we give 
the results of the calculations in Fig. 5. These numerical experiments appear to be 
very encouraging although with a crude approximation to the Hessian the 
convergence can be very slow. 
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ITERATE 

a* 11 

a‘ 2 » 

mm 

a< 4 > 

a ,5 > 

A 

INITIAL 

1.00 

a 

m 

1.00 

n 

B 

INTERMEDIATE 

3.19 

3.64 

1.98 

3.81 

3.62 

c 

FINAL 

2.98 

3.29 

3.55 

3.68 

4.05 

D 

TRUE 

3.00 

3.25 

3.50 

3.75 

Hi 


Fig. 5 Distributed parameter identification via spline analysis 


In a similar way, the algorithm was successfully applied to the wrap- rib antenna 
model (2.4). To simplify the calculations, we assumed here that the stiffness 
parameters were scalers although one could introduce a spline-based space as in the 
previous example. 

Again for simplicity we consider the case where there are six gores (N * 6), 
where a sensor is placed on the outer endpoint of each rib (r - 1), and where an 
actuator is placed at the midpoint of each rib (r *.5). This scheme is outlined in 
Figure 6. 


We introduce the set of N-dimensional unit vectors 



where the components of each are determined by 


(4.16) 
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• POINT - SENSOR LOCATION 

* POINT - ACTUATOR LOCATION 


Pig. 6 Rib stiffness identification: sensor and actuator locations 


1 , i = k 

0 , i*k 

The parameters of the likelihood functional are then given by 

a * .001 

a 

a = .001 
T1 

CT» £ «^6(r - .5) 

And the stiffness parameters of the pla^. are given by 
El = 1.25 .10* nt-m a 
T r = 1 ''S .10* 1 nt/m 
Tq 3 3.5 .10 nt/m 
L = 2.75 .10 -1 m 



(4.17) 

(4.18) 

(4.19a) 

(4.19b) 

(4.19c) 

(4.19d) 


We applied the algorithm then to the case where the unknown parameter was El 
while the other stiffness parameters were assumed to be known. 
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To discretize the suite -space eight equal subdivisions were made in the radial 
direction on each rib and in each mesh sector, in the circumferential direction five 
equal subdivisions were made in each mesh sector. The shape functions on the ribs 
were given by Hermite cubics while on the mesh the shape functions were given by 
splines linear in r and 9. In test cases this discretization produced at least three 
digits of accuracy in solving problems of the form (3.4). In all calculations the 
principle of cyclic symmetry (cf. (2.24)) was exploited to reduce the number of 
calculations. 

Convergence of the likelihood algorithm was very fast (see Pig. 7) when the 
relaxation parameter was taken to be 

y = 2.5 . 

'o 



0.100 0.125 0.160 0.175 

RIB STIFFNESS X 10* 6 ni-m 2 


ITERATION 
H\ -iBFR 
N 

ESTIMATED 

PARAMETER 

_ E, N 

RELATIVE ERROR. 

|(ei n -ei est )/eie S t)| 

1 

1.75000* 10 6 

4. • 10* T 

2 

1.01930* 10 5 

2.-10' 1 

3 

1.20484* 10 5 

3. • 10^ 2 

4 

1.23866* 10 5 

2.*10" 4 

5 

1.23889* 10 5 

- 


El pLANT « 1.26000* 10 5 
e, est ‘ 123889 * 1q5 


Pig. 7 Distributed parameter identification of beam stiffness parameter 
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Numerical experiments also demonstrated an improvement in the sensitivity of the 
identification schemes as the number of measurements was increased. Thus, Figure 
8 illustrates how, for the antenna problem considered, an increase in the number of 
sensors led to a steepening of the likelihood functional. Here the curves were 
shifted transversely for illustrative purposes. We note that no corresponding 
improvement in the parameter estimate occurred in these trials, possibly because of 
the less favorable signal- to- noise ratio which corresponds to sensing in the interior 
of the ribs. 


LIKELIHOOD FUNCTIONAL 



Fig. 8 Sensitivity of antenna stiffness - parameter identifica- 
tion according to number of aensors 

(a) One sensor per rib at r - 1.0 

(b) Two sensors per rib at r • 0.5, 1.0 

(c) Three sensors per rib at r - 0.5, 0.75, 1.0 

More detailed numerical experiments with distributed antenna stiffness 
parameters will be given in a future report. But the results outlined in this report 
demonstrate already the great potential for these algorithms. 
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DISCUSSION— FUTURE RESEARCH DIRECTIONS 

Moderator: Kerman A. Rediess, H. R. Textron, Inc. 

Panel Members: A. V. Balakrishnan, University of California, Los Angeles, 

R. F. Carlisle, NASA Headquarters, J. L. Lions, College de France 
and Institut National de Recherche en Informatique et en Automatique, 

R. E. Skelton, Purdue University, and W. E. Vander Velde, 
Massachusetts Institute of Technology 


SYNOPSIS 

The intent in this session was to present several viewpoints on the direction of 
future research in identification and control of flexible space structures. The panel 
members were given the option of preparing formal papers or presenting informal 
comments. Professor Lions was unable to attend the meeting because of a last minute 
change in plans but did submit a formal written paper which is printed in Session VII of 
these proceedings. This synopsis attempts to capture the main points discussed but has 
not been reviewed or endorsed by the speakers. 

Professor Vander Velde discussed uncertainty management methodology for large 
space structures. He identified six major types of uncertainties that are likely to exist 
in control system design and operation. After assessing the present methodology for 
handling each type, he suggested the additional technology developments needed. 
Altogether, he suggested specific research topics covering: sensor noise ir nonlinear 
situations; external or internal disturbances; modeling approximations; model parameter 
errors; component operational status and reliabUity; and, spacecraft configuration 
changes. He introduced, and discussed to some extent, the concept of an on-line 
system diagnostic monitor for detecting certain types of faults or uncertainties, such as 
model parameter errors, sensor failures or actuator failures. 

Professor Vander Velde concluded that the dynamic character of flexible space 
structures and the likely complexity of their control systems make them especially 
likely to have uncertain component operating status. The problem of detecting 
component failures is coupled with the effects of modeling error whether we like it or 
not. Perhaps this coupling can be utilized, or at least controlled, in the process of 
monitoring the health of the system by use of an integrated system diagnostic monitor. 

Professor Balakrishnan discussed the important research area of uncertainty 
modeling from a perspective of estimation and identification problems in random 
fields. Current estimation and systems identification methods only consider noise with 
rational spectra. We need to look at signals with non-rational spectra. One area in 
which non-rational spectra appear naturally is in the area of random fields. Random 
fields are random processes in which the parameter is no longer time and arise on space 
structures in describing, for example, deformation in an antenna or mirror. Professor 
Balakrishnan used a geophysical example of gravity anomalies to illustrate the 
effectiveness of random field theory in estimation problems when dealing with 
non-rational spectra. Although there exists substantial research on the theory, there 
are only a few practical applications in the literature. Techniques are available that 
can be applied to practical large space structures problems. He suggested that more 
effort be made to apply these techniques. 
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Professor Skelton presented his views on the inseparability of the control and 
modeling problems. When a model is used to derive a control law, then . ne cannot say 
how good the model is witho\;t knowing what it will be used for and what the control 
will be. That means that one cannot evaluate the impact of modeling errors until the 
control is specified. There is a challenge to the community to develop techniques that 
do the complete model error estimation and compensation problem. 

Mr. Carlisle of NASA Headquarters discussed the challenges in controls for the 
Space Station. He views the problem of designing the Space Station as involving trades 
among "subsystems". For example, in trades between controls and structures, one uses 
all of the technology options in both disciplines to optimize the overall design. NASA's 
plan is for an evolutionary Space Station with the initial operating configuration to be 
placed in orbit in 1992. Block changes in configuration and maybe in performance will 
be made subsequently. There is a challenge to the controls community to develop the 
technology that will provide the flexibility for changes and not be the limiting element 
in future expansion of the Space Station. There is a five year period to mature 
technology for the initial system. Periodic block changes will provide opportunities for 
some later technology to be considered. 

Although the Space Station configuration has not yet been determined, it appears 
likely that it will involve large flexible structures because of the solar arrays, radiators, 
or even the b.tsic structure as it evolves and grows over many years. Mr. Carlisle 
described several possible configurations and pointed out the controls challenges. One 
of the major challenges identified was to devise an affordable scheme for developing 
and validating the technology for control of flexible structures in view of the high cost 
of full scale flight experiments. 

Dr. Rediess presented his thoughts on the role of experim'nts in the development 
of control technology for flexible space structures. The three main points covered 
were: why experiments are important for developing the technology; the need for a 
coordinated program from analysis to experiments; and the need for a cooperative 
program among the various participants to make effective use of the relatively rare 
experimental data. It will become increasingly more difficult if not impossible to 
perform ground vibration testing on large flexible space structures. Designers will have 
to rely more on analytical techniques to predict control/structure interactions. At the 
same time, control/structure interactions are becoming more important because of 
increased overlapping of the required controller bandwidth and the structural modes. 
Damping characteristics, important in the controller design, are probably the most 
difficult to predict. Even the properties of new materials, such as metal matrix 
composites, can influence the control/structures design optimization. His conclusion 
was that experiments on the ground and on-orbit will be necessary to develop and 
validate the analysis and design techniques. A well coordinated program is needed to 
bring together the promising analytical methods and the experiments 

Dr. Rediess' final point and recommendation was that NASA, the Air Force, and 
other organizations conducting and/or sponsoring major experiments either in 
laboratories or on-orbit should establish a cooperative data base with broad access to 
the technical community. Such experiments are costly and often difficult to get 
supported. It is imperative that the most effective use be made of these very limited 
experimental data. 
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Before opening the discussion to the general audience, Dr. Rediess invited Dr. R. V. 
Ramnath from Raytheon and an Adjunct Professor of Aeronautics and Astronautics at 
MIT to present some remarks on the application of asymptotic techniques to control of 
flexible space structures (to expand on some of the ideas in Professor Lions' paper). 

Asymptotic analysis is defined as a study of applied mathematical systems in 
limiting cases. The following are some of the benefits of using it. In model reduction 
techniques, an asymptotic ordering of a complicated system into a hierarchical 
structure can be used in selecting the order of the model. The principle of minimal 
simplication gives a stopping rule for this process. In solving a complex linear 
differential equation, the domain of the independent parameter can be extended into a 
higher dimension. An ordinary differential equation is thus converted into a partial 
differential equation and solved asymptotically in the new space. The solution is made 
to coincide with the solution of the original system along certain trajectories. 
Asymptotic analysis yields a quasianalytic solution which can give insight into stability 
and controllability of the system. Approximation errors are generally smaller. Dr. 
Ramnath gave two examples where asymptotic analysis was successfully used to 
generate the controls. He stressed the need of more applications in control problems of 
large flexible space structures. These techniques have been used very effectively in 
other fields such as celestial mechanics and fluid dynamics, among others. 

The following are what appeared to be the most important comments and 
recommendations made during the open discussion period. 

• Some people have suggested that Skylab controls technology will be adequate 
for Space Station. If so, there is no need for controls research to support the 
Space Station. 

• If we rely on Skylab controls technology, we may seriously limit the 
evolutionary growth of the Space Station. 

• It was mentioned earlier in the workshop that NASA cannot afford full scale 
on-orbit flight experiments. What evidence do we have to say that we can 
afford not to conduct full scale flight experiments? 

• If NASA cannot afford full scale flight experiments, then the project 
managers will not use the advanced controls technology and will take a more 
co.iservative approach. 

• There seems to be a change in emphasis in NASA’s program away from 
fundamental research on controls and towards large experimental programs. 
If that is true, we may be in danger of cutting off new innovative concepts. 

• Prom the perspective of a user of advanced technology, it is necessary to 
periodically take advanced theory and apply it to practical problems in order 
to develop an adequate confidence for application to real systems. Without 
that confidence, the theory would never be used, and we would be reluctant 
to support more fundamental work. There needs to be a balance of 
theoretical and applied/experimental effort in NASA's program. 
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• NASA is to be complimented on the excellent program pirn for control of 
flexible structures that was presented by Mr. Russell at this workshop. It 
presents a balanced program and a truly integrated controls and structures 
technology plan. 

EDITOR'S NOTE: There follows a synopsis of the prepared remarks given by the 

panel members. 

H. A. REDIESS: I have a particular personal interest in this workshop. About three 
years ago, when 1 was at NAS/ Headquarters and responsible for this technical area of 
NASA's R&D program, I encouraged G. Rodrijuez of JPL and L. W. Taylor, Jr. of 
Langley Research Center to organize a workshop on control of flexible space 
structures. The first one was held about two years ago at JPL and was very effective. 
This, the second one, has been equally as successful, and I am pleased to chair this final 
plenary session. One objective of this workshop is to provide some feedback to NASA 
and JPL on research opportunities in controls technology for flexible space structures. 
We have invited several noted technologists to lead this discussion with you, the 
workshop participants, to identify these research opportunities. After the prepared 
remarks by the panelists, the audience will be invited to join in the discussion. 
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W. E. V ANDES VBLDE. Uncertainty Management Methodology for Large Space 
Structures 1 think it is safe to cay that the majority of the papers we have listened to 
in the last couple of days have dealt with one aspect or another handling 
uncertainties in control systems, because, after all, it is one of the f Jidam-ntal 
purposes of a feedback control system to accommodate disturbances, noise, and plant 
uncertainties in certain bandwidth ranges. That is really fundamental in what we are 
doing. Our purpose here, in this pr mentation this morning, is to summarize the nature 
of the uncertainties that we need to deal with, to summarize the technologies that we 
have available to us to deal with them, and to identify where there are gaps. 

Now, suppose you were working onboard a space station, and you gradually became 
vaguely aware that there was some persistent oscillation going on. In fact, it was 
getting bigger and bigger, and very soon it was clear to you that you had a problem. 
The question is, what would you do? Maybe more to the point, would you have designed 
your control system in such t w'v that it might have monitored its own behavior and 
taken some action when it observed that it had a problem? If so. what would that 
action be7 Well, this is one evidence of uncertainty in action, because, of cours ;, based 
upon our understanding of the dynamics of the system and environment, the system 
should not behave in this fashion. It is only because we are uncertain about something 
that we might conceivably get this kind of behavior. But here we are oscillating along 
in the space station. What are we going to do next? We need to have some fall-back 
position which is as robust and reliable as possible. It is certainly .rue that the most 
robust system you can build is one that does not do anything. Although, in the context 
of spacecraft control, it may very well be important to at least keep one antenna 
pointed toward earth to maintain wideband communications. Certainly, Mte loop that 
accomplishes that purpose should be extremely low bandwith. It should not. attempt to 
control any flexible mode actively. It should just try to do *h*it ouj auction. Well, 
suppose we have indeed disengaged our unstable spac* .em, or at leia gone 

back to a truly robust feedback system. That would tn« .. ~s • little breathing 
room, and we would have time to sit back and contemplate what ujght hive gone 
wrong. What are some of the uncertainties? What are the tools that we need in order 
to handle these different types of uncertainties? What are some of the research 
directions that we ought to recommend? I am going to address these question' in my 
discussion. The list of uncertainties in control system design and operation that I will 
discuss includes: sensor noise, noise in other components; external or internal 

disturbances; modeling approximations; model parameter errors; component operational 
status; and configuration changes. 

Sensor noise is a source of uncertainty that has been addressed for a long time. Aa 
you well know, if you have a fully linear situation, linear filtering in estimation theory 
is complete. Even for slightly nonlinear measurements or slightly nonlinear dynamics, 
approximate extensions of the linear theory, such as various forms of extended Kalman 
filtering, work well. It is true that if you have a significantly nonlinear problem, a 
nonlinear set of measurements or dynamics, there is a body of theory which is 
applicable. But this is very cumbersome to implement. If you cannot assume from the 
start that you have Gaussian distributions, you have to, in effect, estimate the entire 
probability density function for the state, given the measurements, and that is 
inherently an infinite dimensional problem. In the context of spacecraft control, I do 
not think that the nonlinear situation will be important, unless there are certain types 
of sensors with nonlinear properties. The only additional technology required are some 
specialized techniques for handling state estimation for particular types of nonlinear 
situations. I do not think that it is fruitful to pursue general nonlinear estimation as 
much as specific techniques that might be applicable to specific problems. 
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The second category of uncertainties I will discuss includes noise in other 
components of the system . This is a very important source of noise. There are many 
instances in which autopilots pick up electronic noise that results in noiry actuator 
operation. Analytically, this appears as an internal system disturbance. I have included 
that in my next category, which is internal and external disturbances. Here again, if 
the system is fully linear, and if these disturbances can be modeled as random processes 
with rational spectra, then linear system theory is complete. I mention rational 
spectra, because, if one were to design filters in the frequency domain, one would 
probably use spectral factorization. This is only easy to do for rational spectra. In any 
case, for a significantly nonlinear system, the situation is just as it is for sensor noise. 
Again, I would say it would be more fruitful to pursue specialized analysis te uiques 
that would apply to particular situat : ons. 

I have mentioned one other topic under additional technology requirements: 
internal disturbances that are generated in one part of a spacecraft and propagated to 
other parts. In fact, just yesterday, we heard one presentation in which it was 
emphasized that, for multipurpose spacecraft, very often the most significant 
disturbances are those that one payload imparts to the other payloads. For example, 
you might have a scanner, with an oscillating mirror or something similar, which is 
creating a disturbance to other parts of the spacecraft. So, why not try to isolate tho .5 
disturbances within the modules where they originate. I am sure that this is not a new 
idea. I list it here and emphasize the fact that active disturbance isolation may make it 
possible to do this even better then purely passive structure design can do. 

This next category is one that has received a great deal of attention in this 
workshop. This is warrected because we are assured in advance that, in dealing with 
flexible spacecraft modeling uncertainties, we are going to be working with an 
inaccurate model of the true dynamics. There a number of methodologies that we are 
all familiar with to deal with the situation. One such method involves designing a 
control system on tne basis of a reduced-order model and then evaluating it with a 
higher fidelity model. We know several approaches to the problem of model order 
reduction. Absolute stability theory is a way of dealing with stability of systems with a 
nonlinear operator confined only to one sector. And finally, there is a very useful 
approach using singular values for evaluation of robustness properties of multi-variable 
systems. There are several additional techniques in modeling approximations that 
would be very useful. The first one has to do with small but arbitrary damping 
characteristics cf space svstems. The characteristics of flexible structures which will 
be least understood in advance will be their damping properties. But, if the damping is 
small, you have the impression that you should be able t.o design a control system which 
does not really depend on what is the specific damping. I have not attempted to do 
this. It just seems to me that it might be possible. In a similar spirit, slight 
nonlinearities and uncertain stiffness properties in higher order systems may not be 
important under steady regulation. They might be important, for example, under slew 
maneuver.. The idea of design procedures for control systems, in which modeling 
inaccuracies of an unspecified form are incorporated directly, is a highly desirable 
objective but a difficult one to be sure. 

Model > arameter errors are a somewhat more structured kind of uncertainty in the 
plant. Our conception is that w^ have a model which is adequate in some sense. We 
just do not know quite what the values of the parameters are. The standard approaches 
available to us include passively robust controllers and adaptive systems. There are two 
basic types of adaptive systems: in one you explicitly identify parameters; in the other 
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you do not. We have already heard some interesting things about adaptive systems at 
this workshop. Under the heading of robust controllers, it is always my impression that 
robustness normally is bought at the cost of some aspect of performance. It would be 
well to work on that trade-off and attempt to improve it. Under parameter 
identification, the issue of how to efficiently model a system is important from the 
standpoint of the original design, as well as from the standpoint of identifying the 
system. One of the thoughts in an earlier paper at this workshop appeals very much to 
me. The idea is to model certain modules of the system as distributed parameter 
models and then piece them together at the boundaries. It strikes me that this might be 
an efficient way to handle some configurations at least. The idea of optimizing inputs 
for identification is not a new thought by any means. It is true that, in order to identify 
all of the properties of the system, you have to push on the system with your own 
actuators and measure the effect on the output variables. You would like to do it in a 
way that affords you the best identification. In the context of flexible spacecraft, that 
has to be done carefully. I mention the problem of adjusting the controller following 
identification of the plant simply because of the fact that the design of the controller 
in the first place is such a difficult chore in the large space structure applications. If 
we indeed identify the plant, and find it to be slightly different than the model we have 
in the original design process, then the controller has to be adjusted to account for the 
differences. That may not be a trivial thing to do. In our present state of knowledge 
about the design of these systems, control design seems to be very tricky. The designs 
have to be tuned Just about right in order to perform well. 

Finally, under adaptive control we actually heard some interesting results on 
stability theory yesterday. It Is still true that it is somewhat of a risky business. The 
conditions for stability generally depend on the relative magnitude of things like 
disturbances relative to known inputs. It would be very well if we could use and depend 
upon adaptive controllers. Any research that can be done to clarify stability properties 
of adaptive controllers would be helpful. 

Component operational status has to do with the failure status of the components 
of the system. We do have a number of methods for failure detection and isolation 
(FD!\ Less work has been done on the problem of reconfiguring the system following 
the detection of a failure. In the area of computers and signal transmission networks, 
there is q’lite a substantial background of work. Under additional technology, I mention 
the fact that FDI system concepts are needed that are less sensitive to unmodeled 
dynamics. It is true that, if your FDI depends upon system dynamics modeling, then it 
suffers from unmodeled dynamics, Just like the controller design problem suffers from 
unmodeled dynamics - and maybe in an even more sensitive manner. Methods for 
reliable reconfiguration are certainly not settled at this point. In effect, we have to do 
• redesign of the system using one less component once we discover a failure. The 
redesign process is not all that simple because the design process is difficult in this 
application. 

i mentioned fault tolerant assembly of loosely coupled computers only to suggest 
that the fault tolerant computer assemblies that we are dealing with now, for the most 
part, are based upon varying degrees of synchronism - very tight synchronism in some 
cases and looser in other cases. In a large spacecraft - particularly one tnat has been 
assembled in space out of a number of modules, with each module controlled separately 
prior to its assembly with the rest - you will have control systems with asynchronous 
computers. You would like to be able to monitor those for their failure performance as 
well. And finally, a very important item is the validation of the operational software 
which is executed in the system. 
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The last category of uncertainty >. onfixuration changes . This is very important 
for the space station, which is to be a .owing system. It does seem to me that we will 
need to use pre-programmed changes in the controllers to coincide with configuration 
changes. I suspect that we will use a low bandwidth robust controller during the period 
of a change. Someone observed yesterday that it would be nice to avoid the need for 
that by using adaptive controllers that were adequate to handle configuration changes. 
You certainly would have to suppress transients in the control system when a change in 
a controller is made, but I think that is not hard to do. Another problem will be the 
isolation of the disturbance due to mating a new module with the existing assembly. 
One should isolate the effect of that to the local region, if possible, to prevent the 
disturbance from propagating throughout the structure. 

These are some of the different types of uncertainties and the technology that we 
have to deal with. But, what about the unstable spacecraft I mentioned at the 
beginning? We still have it in the backup mode and have not yet figured out what is 
wrong. Do the methodologies that we talked about answer this question? Are they 
adequate to help us discover what happened? Well, I would say not really. Nothing of 
what we spoke of is directly applicable to the diagnosis of a problem like this. So, it 
would be very nice to have an additional tool, which I am calling here a System 
Diagnostic Monitor. A System Diagnostic Monitor has the property that it monitors the 
health of the system. Not only does it monitor the health and give a go/no-go 
indication. When it is no-go, it isolates the failure. This is helpful in figuring out what 
has happened. In particular, you might be able to monitor parameter values in the 
model to detect changes as well as monitoring for failures in components. A desirable 
property of such a monitor would be that it can be reconfigured from time to time so 
that it can monitor different characteristics of the system. A suggested possible 
structure for such a system monitor is shown in Figure 1. The controller can either be 
in operation or not. It does not make any difference as far as the monitor is 
concerned. The lower part of the figure shows a failure detection filter which has the 
same structure as a Kalman filter or any other linear filter. In fact, if the gain matrix 
D which operates on the measurement residual were chosen to be the Kalman gain 



Figure 1. A Possible Structure for the System Diagnostic Monitor 
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matrix, this would bo t Kalman filter. But it is possible to design the gain matrix in 
different ways. In particular, it is possible to design that gain matrix such that, for 
certain selected events, the residual can be constrained at the output to lie in a fixed 
direction. This idea was first proposed for the purpose of doing component failure 
detection, but it can also be used to detect individual parameters of a .nodel that have 
been mismatched I will just give you a quick illustration of this in Figure 2. 


STOTT 1. ACTUATOR 3 FAILURE 

STRUT 3. FIRST B0D1KO MOOS FWQUBtCT BMOR 



Figure 2. Simulation of the System Diagnostic Monitor 
(Actuator 2 Failing off at 1 Sec) 

Figure 2 shows a dynamic system modeled with this type of filter or monitor 
designed to monitor two events: a failure in actuator number 2 (there are four 

actuators in this model); and a mismatch in the frequency of the first bending mode. 
The graph is a time history of two output indicators for the two events. Initially, 
everything is nominal, and the outputs are indistinguishable from zero. But after one 
second, we simulate a failure in actuator 2. At that point, the output indicator of the 
first event bocomes non-zero, while the output indicator of the second event, which is 
related to the frequency, still remains zero. If the system were configured to monitor 
these two events, you could clearly say what was wrong. The other case would be if the 
value model of that first bending mode frequency were wrong, as is shown in Figure 3. 
In that case, the monitoring system is being excited right from the start. The second 
event indicator is nonzero, where as the first-event indicator, corresponding to the 
actuator failure, stays zero. Again in this case, the monitor would be helpful in 
discovering what had gone wrong. I do not claim that this is a closed book. In 
particular, the effect of modeling inaccuracy on this kind of a monitor has not been 
resolved and is very important. We have made other runs for the case where some 
additional bending modes, beyond those that were involved in the design of the filter, 
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were simulated. That thoroughly confuses the monitor because of the leakage of the 
unmodeled dynamics through the filter to the indicator outputs. 

In conclusion, the management of uncertainty is certainly nothing new in control 
system work. Basically, controlling uncertainties is a fundamental reason for feedback 
in controllers. The large space structure problem is especially sensitive to some of 
these uncertainties. It is especially susceptible to component failures. Now, whether 
we like it or not, the problem of trying to detect component failures is coupled with the 
problems of modeling inaccuracies or model error parameter errors. I would suggest 
that if the coupling cannot be used to our advantage, then at least it could be used in 
monitoring of the health of the system. 


EVENT 1. ACTUATOR 2 FAILURE 

EVERT 2. FIRST BENDING NODE FREQUENCY ERROR 



Figure 3. Simulation of the System Diagnostic Monitor 
(First Bending Mode Frequency 10 Percent Low) 
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A. V. BALAKRISHNAN: Some Estimation and Identification Proble m* in 
Pie Ids In this lecture, the usefulness of random field theory in the estimation and 
control of large space structures is outlined. Random field theory can be used to 
characterize the deformation of surfaces of antennas or of other flexible structures. 
Professor Balakrishnan has developed a filtering technique for random fields which 
processes data to obtain estimates of the shape and model parameters. Most of the 
current filtering techniques exploit time dependency and the rational power density 
spectra of the signal. Random fields often generate a nonrational power spectral 
density of the signal and they do not have a time parameter. They have 2 or more 
parameters. Therefore, current filtering techniques cannot be used. Batch estimation 
is cumbersome to apply because it requires all the data to be processed at the same 
time. Hence, scanning schemes are superior. In two-dimensions, if the data is scanned 
in certain directions, the problem is converted to a one dimensional problem, and 
Kalman filtering can be applied. Professor Balakrishnan has suggested an improved 
method. Instead of considering one scan line at a time, consider many scan lines 
simultaneously. Thus, in a direction orthogonal to the scan line, a vector can be 
constructed consisting of states at points on the scanring lines. Thus, the problem is 
converted into a vector filtering problem. If the spectral model satifies certain 
sufficiency conditions, infinite-dimensional Kalman filters can be used. Necessary 
conditions are not known to date. However, the resulting Kalman filters can be used to 
handle two dimensional data very efficiently. 
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R. E. SKELTON: Model Error Structure and the Inseparability of the Control and 
Identification Problems This morning I would like to share with you my views on some 
of the problems we all face in the modeling and control of large space :;tructures. 
Specifically, we will be reminded, in several different ways, that the modeling and 
control problems are not independent, and that this fact has consequences la the field 
of identification, a maor subject of this workshop. 

A Simple Experiment 

A graphic demonstration of the inseparability of the modeling and control problem 
is presented in [1]. The essence of this experiment is as follows. Let Sj denote a model 

of the system S, but S^ * S due to modeling errors. Let be another model of the 

system S, but also S^ * S. In the experiment of [1], S was represented as the first 44 

elastic modes of a flexible spacecraft, S 1 w«.s one subset of these modes {1, 2, S, 9, 

17), and was another subset of these modes {1, 2, 17, 4 10}. Now, let denote the 

controller which is optimal with respect to the model S^, and let S c2 denote the 

controller which is optimal with respect to model S^. Let the performance of these two 

controllers be evaluated when driving the "real" system S. Which model should be 
better for control design or 5^7 Fig. 1 illustrates that the answer is that neither 

model is always best. For a particular range of control gains is best, and for another 

range S is best. In other words, one cannot say which model is best independently of 

some statement of the control law. This reinforces the notion that " one cannot say 
what is a good model without saving (precisely) what the model is going to be used for ". 




Figure 1. Any given model not always best for control design 
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llow, if there exists a model for which the control design c&nnot be improved 

(Fig. 2), then the modeling end control problem can be called separable with respect to 
that model S^. Of course, such notions involve all possible (infinite in number) models, 

and the question mates sense only with respect to a given control design methodology 
(LQG in the case of Fig. 1). 



Figure 2. Modeling and control problems separable with respect to S^. 

Nonetheless, this inseparability of the modeling and control problems holds without 
regard to the method of producing the model, whether that method be modeling from 
first principles (known physical laws, F = MA, etc.), or whether that method be 
modeling from on-line data, commonly called the identification problem. Our 
experience at Purdue has always produced the inseparability described in Fig. 1 and 
never the separability phenomenon of Fig. 2. 

The theory of Root Locus requires the model to be the same for all control gains. 
Hence, the Root Locus method seems useful only when the modeling and control 
problems are separable. In the original Root Locus work of Evans (2), he imagined that 
the most appropriate representation of the plant model remained the same for all 
controls (as it turns out, this is equivalent to the assumption of no error in the plant 
representation). Since modeling error creates such havoc with our textbook theories, it 
is worthwhile to look more closely at the nature of modeling errors. 

Model Error Structure 


Considering only linear systems, let us label the state of our finite dimensional 
representation of the plant as and the interconnection with the remaining states as 

x^.. Hence, the system actually obeys equations (1) 
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but the representation available to the control designer is 

X R = Ax R + Bu y = Cx R (2) 

A r = A + AA, B r = B ♦ AB, C R = C + AC 

subject to parameter errors in AA, AB, AC. But, rewriting (1) using these definitions 


t 


a , . A i-1 A.Jt-a) . . .. „ 

e i ’ ! A RT A T • T A TK X R <0>d0 * ^R 

o 

(3) 

t 

f. = / A RT A^. _1 e A T <t ~ a) B^.u(a)do = Qu 

(4) 

e t = w R +A RT [e^T 1 x^.(0) + / e w^,(a)da] 

o 

(5) 

allows x^. to be eliminated in (1), yielding, 


X R = A R X R + B R U + *t (tl + e l (x R > + f l (u) 

(6) 

’ = C R X R 

(7) 

e i “ P i x R + *i+l 1 “ 1 P i * A RT A T ^TR 

(8) 

“• °i - a rt4"\ 

(9) 


where we have fixed the coordinates in (1) so that = 0. Such coordinates always 
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exist if dim y S dim Xg. It is also possible to imagine coordinates for which = 0, 

C T * 0. In this case the error term of the sort (c t + e^ + f^) appears in the output eq. 

(7) instead of the state eq. (6). But, these variations do not alter the story, so I shall 
stick to the form (6) - (9), which is now a representation of the exact model error 
system . Comparing (2) and (6) - (7), it is clear that any model (2) which we may write 
for a linear system always has an associated model error vector of the form e = e^(t) 4 

e^(Xg) + f^u) + e^, which is the stun of four types of modeling errors: e^t) * errors 

which are functions of time (we usually call these external and internal disturbances), 
e^(Xg) = errors in model order (an integral operator P on Xg), f^(u) = errors in model 

order (an integral operator Q on u), e^(AA, AB) = errors in parameters = (AAXg 4 

ABu). Note especially that due to f ^(u), the impact of the model error e = e^ + e^ + f^ 

4 e^ cannot be determin 1 independently of the con t rol input u. The integral operator 

fj = Qu is an explanation of the inseparability 6f the modeling and control problem. 

Model reduction theories concern themselves with the term e^4 f^. Disturbance 

accommodation techniques concern themselves with e^. There is no control theory 

which promises the simultaneous accommodation of all four types of modeling error. 


More importantly, note that the phrase "parameter error" has no precise meaning, 
since the submatrix Ag * A 4 AA is coordinate dependent (even though we have a 

specific A in mind) and, in fact, this coordinate choice affects all the four model error 
terms. Therefore, each term of the model error vector is nonunique. The challenge 
here is this: If we succeed (by choice of coordinates or by compensation) in reducing or 
eliminating one type of error, snot vsr type may get worse. 


In the future, I think we must find ways to estimate the entire model error vector 
and not just certain terms in it. Because the subject of model error estimation 
embraces a broader class of errors, it has no substantial progress yet to report. One 
attempt |3] writes the model error systems (6-9) in the form 

x = Ax 

(10) 


y = Cx 


where x = f T ,f T ,...,y T ). (The time dependent error e has for convenience 

K 1 

been written in the form e = P (t), where the equality is only in a mean-squared sense 

t Y 

T 

and holds when the independent basis functions y (t) = (y^Oor-lt),...) form a complete 
set.) The matrix A has a specific form dictated by the aggregation of equations (6-9) to 
form (10). The model error vector is estimated if x is, since (Xg, f^, y^) » x^C j 
for some The parameter A and the state x can be estimated under certain 

conditions [3]. 
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The dynamical system described by 


A A A A 

x = Ax + F(y - Cx) 


a -IT a ',T 

A = 0 C i Q(y - C&x 1 


drives this Liapunov function downhill (V & 0), 


V = ||y - Cx ||p + || A - A||q 


(il) 


( 12 ) 


provided 

C T QC(A - PC) + (A - FC) T C T QC + L T L » 0 (13) 

is satisfied for some L. 


The good news is that estimators of the form (11). when used in the above spirit, 
embrace a broader class of model errors. The bad news is that condition (13) is 
unverifiable even when it is satisfied (A is unknown). Partitioned parts of (13) yield 
certain specialized results, reported in [4]. Most adaptive control and estimation 
algorithms can be explained in terms of simplications of problem (11-13). Adaptive 
control ignores (e^ L, y) in the x of (10). Orthogonal filters [4] ignore (e^ f^) in the x 

of ( 10 ). 


A more direct approach to model error estimation was posed by Rodriguez [5,6]. 
By lumping e. 4 f 4 e into one term e » e. + f. 4 e , and allowing error terms also in 

1 X t X 1 1 L 

the output equation, y * Cx_ 4 e , the model error system (6-9) can be written 

K y 


4 B “ * e x 


y = Cx n 4 e 
’ R y 


(14) 


or in operator notation (assuming zero initial conditions) 
y = K^u 4 K 2 e x 4 e y = KjU 4 [Kjl] 
where the operators are defined by 



(15) 


K.u * J Ce 
1 o 


A(t-a) 


Bu(a)da 


( 16 ) 


K 0 e = / Ce A(l ' a) e (o)da 
2 x x 


o 
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Hence (15) may be written 


y = y- K^Ke , K = [K^. I] 


T A T T. 
e a (e e ) 
x y 


and has the minimum norm solution 
e = K*y. 


(17) 


(18) 


The good news is that the model error estimation (18) requires no decomposition of the 
model error vector as in (6). (This is an advantage due to the nonuniqueness of the 
decomposition.) The bad tews is that the pseudo inverse of an integral operator K* is 
required, and this is net an on-line calculation. Some numerical examples are given in 
[6J. 


Identifying Structures Under Control 

We now look at the parameter identification problem as a modeling method. 
Having already reminded ourselves that the modeling and control problems are not 
independent, we certainly expect that the identified model will be dependent upon the 
control inputs. Suppose that the closed loop system of Fig. S is "identified" as G'(s). 
rhen, since the controller H(s) is known, the identified plant is recovered from 
knowledge of G'(s) and H(s), 


G’(s) 

G(s) = (19) 

1 - G’(s)H(s) 

Now, the interesting obs f ation here is the "identified plant" G*(s) is a function of the 
controller H(s). Hence, it the controller H(s) is changed, the plant as the controller 3ecs 
it is different. The fact that the plant looks different with each controller (hence with 
and without control) has not penetrated the identification research, judging from a 
sparsity of papers on the subject of identification under feedback. 
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Thla iyitM la "ldantlflad" »a G*( a ) 


Pigur* 3. Identifying structures under control 
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Since the identified closed loop system G’(s) is obviously a function of the feedback 
law H(s), let us write G'(H(s),s). Then, it is clear that the iden Ification and control 
problems are separable iff G’(s) is invariant under H(s); that is. iff 

G(s) 

G'(s) 

1 + G(s)H(s) 

But this can happen only if there are no modeling errors (G'(s) is exactly correct). 
Hence, the inevitable modeling errors force the inseparability of the modeling and 
control problems. 

The conclusion here is that it is very difficult to define a meaningful identification 
experiment, one that provides improved knowledge with which to design a control lrw. 
One logical approach to this dilemma is to reject any identif '.cation result which does 
not produce the correct H(s). That is, apply an identification method to the closed loop 
system treating both plant and controller as unknowns. Then reject the "identified" 
plant unless the "identified" controller matches the known controller This is not a 
sufficient condition for a successful identification experiment, however. The problems 
involved in such a unified identification plan include: 

1. One identification approach may satisfy these conditions and another may not, 
even though both methods may have convergence "proofs" (which are oased upon 
the assumption that only parameter errors are present in the model error vector). 

2. Even after identification, the math'matical techniques used to separate the 
identification of the plant and controller subsystems will affect the results and 
conclusions. 

3. Convergence criteria need to be established to determine when identification under 
feedback is successful. 

Conc lusions 

Both dynamics and control textbooks are written as though the modeling problem 
and the control problem are separable. However, due to inevitable modeling errors, 
they are not separable. Moreover, since the modeling and control problems are not 
separable, neither are the identification and control problems. Research is needed to 
identify structures under feedback control. Otherwise, it is not clear that open loop 
identification experiments will be useful for providing models for control design . R. A. 
Frosch put it well this morning in his ACC plenary talk when he said, "We are not sure 
of what we are doing when we abstract the real world". 
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R. F. CARLISLE: Future Research Technology Directions for Space Station* What we 
are attempting to do, using the available literature and expertise, is to get a solution 
that can optimize the station by further application of controls. In the world of 
spacecraft systems, when the power subsystem fellows tell me to bring on-line 
improvements in power, I tell them to forget it. This is an expense that I do not think 
the system can afford. We will go back to the user and reduce the power requirements 
by applying other techniques. We are dealing with trades in the integration of 
subsystems. Thus, when a control system designer says I cannot solve your problem. I 
go back to structures. We just have to beef it up and fly it anyway. When I cannot 
afford the structure and the weight to do that, then I go back to the cortrol people and 
say how can you help me. That is the world that we are dealing with. 

On the space station, as you know, we have a planned approval for the program. 
We are all anxiously waiting for the approved budget this year. If successful this year, 
we are looking for a flight date approximately in the 1992 time frame. In the world of 
research and technology and advanced development and design, that means we can 
expedite some promising hings in the pipeline. This will be done by increasing funding 
for certain technologies and bringing them to a state of maturity and readiness that 
perhaps can improve the initial space station design. Let me share with you some of 
the things that I see in the design profile. 

First, we have a ground rule that says the space station is going to be an 
evolutionary design. That means it is going to grow and change with time. There will 
be block change: in configuration, and maybe block changes in performance. Therefore, 
it behooves us to do the best we can in the initial design, so that we can evolve 
gracefully and economically to meet those challenges. If we lock ourselves in with a 
design that does not have flexibility and growth potential, then the control system may 
be one of the things that bounds the extent of evolution that we could hope to achieve. 
AU of the sub-systems are trying to bring robustness, flexibility, and evolution of 
growth, so that they are not the ones to bound the potential for space station 
evolution. Another interesting thing in space, as opposed to aeronautics, is that the 
space station is essentially the first time (really the shuttle was) we can repair 
on-orbit. We are asking for 20 years continuous life with on-orbit maintenance. That 
gets into the practical world of hardware and failure analysis. It also gets into the 
question of how we accept failures in the most economical way, and whether we want 
to have built-in redundancy and fault tolerance or repair on-orbit. Repairing on-orbit 
means we need tests and analysis in order to locate failures, and a means of 
accomplishing the repairs. Thus, we are looking at extremely interesting challenges 
with the space station. Essentially, we have 5 years to mature promising technologv 
and applications. But in an evolutionary sense, promising research that could be made 
into a block change is also an opportunity. Overall, it is a world of opportunity that I do 
not think I have ever seen before. Thus, I want to illustrate just a little more what I 
t hink the problem is, and review with you what 1 understand of where NASA has gone 
with their funding that is applicable to space station. Finally, want to leave with you 
our definition of what we are looking for that is in the spirit of pportunities for future 
research. 

♦The speaker refers to slides used in the oral presentation that were not available ror 
inclusion in this written report; however, the major points are evident even without the 
slides 
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If this were our initial space station-and it is not. it is just typical-each one of 
these would be a payload flight for the shuttle. So you place one of these in orbit and 
leave it there, and then you bring the shuttle up and bring another module to it. Now, I 
think this is fairly complex for the initial operating configuration, but let us just assume 
that for now and walk through the control problems. You have to be able to stabilize 
something like this. You have to be able to hold it well enough in attitude so you can 
mate to it, and mate to it with only the resolution of what the shuttle can do, or what 
this can do. As that changes, there are obviously major shifts in mass distribution. 
Maybe two modules hooked together can still be considered a rigid body. But, as it 
grows in size, as the whole structure gets larger, then you begin to wonder what the 
relative rigidity of these joints is. The initial space station has a requirement for 75 kw 
useful on-board power. That is a solar array of about 20,000 square feet, half an acre 
and about 20,000 pounds. So, it is not trivial. There are significant distributed 
low-frequency appendages on this structure. This is the radiator, which dumps the 
waste heat. Because of the efficiency of a closed cycle like this, there is almost 75 kw 
being radiated out. So, this is not a trivial appendage either. 

This is another illustration of a larger space station. Now look what we have done 
to ourselves. If these were rigid bodies with classical control, and then somehow this 
whole matrix grew to this configuration with these appendages, then I wonder if that 
original classical control design would stand some graceful economic evolution to 
control that cluster as it is shown there. I would say that is reasonably realistic as to 
what the evolution would be. Going a bit funher, this is a collection of conceptual 
drawings of what it may grow to eventually. I cannot define anyone of those as being 
realistic, but any of them could be. We would like to drive in that direction. The 
challenge to you is that control systems of the original configuration should not 
constrain us, if that is a reasonable request. I do not know whether it is or not. 

Let me go back then to a very quick review of where we have been in NASA 
funding for control of large flexible structures. In about 1978, we started working on a 
large structures program. Initially, we homed in on antennas. These are two antenna 
configurations: the hoop column and the wrap rib, representing Harris Corporation and 
Lockheed designs. In the control %rea, we dealt with the problem of maintaining the 
relationship between critical points of the feed and the reflector. We also dealt with 
the problem of surface control. This program started as a structures program. As we 
got into the program, we recognized that we were not dealing with a system unless we 
addressed the control problem and the RF characteristics. We had a ground test 
program that built up full-size segments of these antennas. Here is a section of four 
gores of the hoop column. This shows it built up on the ground and spreading the mesh. 
These are small reflectors that are out on the mesh for lining up the mesh. They can 
also be used to close the loop around to measure and control the surface conditions. 
This is a full scale model of the wrap rib, with the mesh deployed. The ribs cannot 
support themselves in one G. They are hung from spring assemblies attached to the 
ceiling and to the floor. We hoped when we started that we could measure some 
dynamic characteristics. In retrospect, I have to say that it was successful research 
since we learned that we failed. We could not get significant resolution of any 
importance to measure dynamics. We did not spend enough money or time to get the 
precision of realistic measurements for dynamics. This led us to conclude that since we 
did not know what to do on the ground anymore, we were going to have to fly. We 
proposed a flight program planning exercise in-house. We recognized the expense in 
undertaking such ambitious planning, but we felt it had to be done. The 
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plan grew to the point that we killed it with economics. We ended up with an in-house 
estimate of approximately 350 million dollars to run perhaps a single flight that would 
only learn something about control dynamics. Figuring the system would not stand that 
much expense, we decided try to get a payload to share expenses with us. We couldn't 
find a payload willing to take the risk of committing to an unproven structure. Since 
nobody wanted to go with us, and we could not afford it alone, we stopped that program 
and faced the fact that we do not know how to go full size in flight. Wc are still 
working on the problem. I think, and several others agree, that we are developing 
powerful technology. We want to achieve that plateau of technology where we can 
handle control of low frequency interactive modes. We are now planning an exhaustive 
ground test on a much smaller article. The question is how do we proceed to validate 
that design. Can we get sufficient measurements on the ground, with instrumentation 
and fixturing errors, to have the confidence to fly full scale, larger vehicles? Or do we 
need a flight test to validate the ground test? If we have to have a flight test, how do 
we scale? If we use a small ground sample and fly that same sample, to get the 
relationship between the on-orbit and ground characteristics, do we gain enough 
confidence to fly full-size structures? Those are the things that we are really worrying 
about. In the current planning of the space station, there are several configurations. 
The real configuration has not been decided. The eight contractors that have been 
working with us all recognize that controllability is a critical issue for confidence in the 
original design. This configuration, called the delta, has a rigid structure. It is an 
inherently stiff design, with the modules lined up at the ends of the triangle of a 
pyramid. It is the most rigid design. It is also probably the one with the least growth 
potential. These two are quite similar, with the modules gathered in a fairly stiff 
arrangement and with a flexible appendage. This design offers the opportunity and 
advantage that the shuttle can come in and berth to it away from the solar arrays; but 
it can, if it grows in evolution, significantly change and complicate the control 
problem. The question that I pose is: how can you help us with the tough decisions that 
we have to make in the next five years? What is the original control configuration that 
can meet most of the desirable features that I mentioned earlier? Our problem may be 
summarized as evolutionary growth, size and complexity, possible need for on-orbit 
test, and t long-life requirement. We have the opportunity for solving it with control 
technology or with structures technology or by some combination of each. No matter 
what that combination is, we still have challenge of how the initial design should give us 
the most options. 
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H. A. RBD1BSS: The Role of Rypariments in the Development of Control Technology* 
Experiments, both in Uooratcries on the ground and flight experiment* on-orbit, are 
necessary in the development and validation of controls technology for large flexible 
space structures. I was pleased to see that it was recognized in this workshop, and that 
there were so many papers that touched on experiments. 1 have not prepared a formal 
paper on this subject, and for the most part I am singing to the chcir. However, there 
are three points that I would like to discuss: 

• Why experiments are important for developing the technoiog 

• The need for a coordinated program to validate techniques from analysis to 
simple experiments to complex experiments. 

• The need for a cooperative program among government, university and 
industry to make the most effective use of experimental data. 

Why Experiments are Important . We are beginning a new era with large space systems 
that will tax our current methods of developing spacecraft. We are considering space 
systems that are sufficiently large that they must be developed in space in sections and 
assembled. The systems will be designed for the zero-g vacuum environment of space, 
and we will not be able to fully assemble and test the entire system on the ground 
before deployment. We will not be able to do complete ground vibration testing. 

Ground vibration testing has been an important aspect of developing control 
systems for aerospace vehicles where there are significant structures and controls 
interactions. We currently do the best job we can in modeling the structural dynamics 
of a vehicle, but we have an opportunity to fine-tune our models and, in turn, the 
control systems with ground vibration data. 

If we cannot test the complete space system, we need to have a higher degree of 
confidence in our modeling and design methods than we currently have. We may have 
to do on-orbit systems identification of the complete space system, or even as it is 
being constructed in the case of a space station, before the attitude control system can 
be operated. We must have confidence in our systems identification methods before we 
get to operational systems. 

There are other factors that further complicate the problem of having confidence 
in our analysis and design methods. Three significant ones are: 

1. Structural modes will be overlapping the required controller bandwidth. 

2. Uncertainties in predicting damping. 

3. Introduction of new composite materials. 

Figure 1 points out the trend we expect to see in the control and structure 
interactions of future spacecraft, as they get much larger and hence more flexible. In 
today's spacecraft, the structural modes are sufficiently separated from the controller 
bandwidth that relatively simple filtering is adequate to deal with the interaction. In 
future flexible spacecraft, the structural modes are expected to overlap the required 

♦Several of the charts used by H. A. Rediess are from a NASA presentation on Con rr\ 
of Flexible Structures and were used with NASA permission. 
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controller bandwidth. Requirements for rapid slewing, precise pointing, and very 
accurate shape control will force the designer to consider the control and structure 
interactions. Active structural mode suppression may be necessary to achieve effective 
and affordable designs. We need a higher degree of confidence in our structural 
modeling and control design techniq for active controls. 




Figure 1. Trends in spacecraft control and structures interactions 
(From R. A. Russell of NASA Headquarters). 

The second factor that complicates the problem is the uncertainty in predicting the 
inherent damping of space structures. Professor Vender Velde mentioned earlier that 
damping characteristics are probably the most difficult to predict and yet are very 
important in control design problems. Damping characteristics of complicated 
structures, such as a large tetrahedral truss antenna with a long offset feed, can be 
very different in the space environment than on the ground. Figure 2 shows 
experimental data on simple- joint damping obtained by Professor Mead of South 
Hampton University. Test results are shown for a no-joint specimen and for a joint 
specimen in air and in a vacuum. Note that joint damping in vacuum is about cne-third 
that in air. These data are for one simple type of joint, for one type of material 
(aluminum), and for the fundamental vibration mode in oscillating tension and 
compression. Considering the multitude of types of joints, materials, and vibration 
modes, one can see the difficulty in predicting damping of large complex space 
structures. There is a need for a series of experiments on small and full scale 
specimens. If possible, these experiments should be conducted in the space environment 
to develop an adequate data base and prediction techniques. 

The third factor is due to the introduction of new composite materials, such as 
metal matrix composites (MMC). MMC have several characteristics which make them 
particularly well suited for spacecraft applications. They have a high strength to 
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Figure 2. Joint damping data for oscillating tension/compression tests 
(From Prof. Mead of South Hampton University). 

weight ratio, excellent dimensional stability, and low coefficients of expansion. 
Certain important characteristics, such as stiffness and damping, can be changed and 
tailored by the way the MMC is made. The effect of material characteristics could 
have on spacecraft performance is illustrated in Figure 3. It depicts the elements of a 
study being conducted by HR Textron for the Naval Sea Systems Command. The 
spacecraft is a hypothetical surveillance system that is required to maneuver for 
survivability and yet meet stringent pointing and shape control requirements even while 
maneuvering. The chart in the center illustrates the effect of one material parameter, 
stiffness, on line-of-sight error at various maneuver acceleration levels. It is necessary 
to characterize these new materials in laboratory tests in order to assess the effects on 
spacecraft performance. That would, in turn, improve the ability to predict the 
structural dynamics characteristics. Control and structural interactions will become 
more significant in future large space structures. The ability to predict accurately the 
damping characteristics is important for effective controller design. Characterization 
of MMC properties could lead to optimization of new materials, structures, and controls 
for flexible space structures. These and other factors will require a var y of 
experiments in laboratories in space to develop the technology with confidence. 

Need for a Coordinated Program The second major point I wanted to make was the 
need for a well coordinated program to validate techniques from analysis to simple 
experiments to complex experiments. The final report of the NASA Space Systems 
Technology Advisory Committee, ad hoc Subcommittee on Controls/Structures 
Interactions, dated Tune 8, 1983, strongly supported the need for ground and on-orbit 
testing for developing control/structures interaction (CSI) technology. Figure 4 is from 
that report and shows the ad hoc Subcommittee's recommendation for a coordinated 
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Figui e 3. Effects of material stiffness on spacecraft performance. 


experimeat*! program to develop and validate the technolsgt. It also suggests a time 
frame' for ' virion to reach a high- c oaf Meace Level by the early 

1990*8. The process is iterative and invclv* technique development, ground testing, 
and on-orbit testing. An important part c on-orbit testing is the valMation of 
groand- testing techniques. The report suggests a major controi/stractnres interaction 
flight demonstration to validate the techniques once the analysis tools and ground 
testing procedures are refined. The report indicated that such a flight demonstration 
could be conducted in the early 1990*8. That is the type of coordinated analysis, 
ground, and flight test program that 1 also believe is needed to adequately validate the 
technology. 

Figure 5 shows one element of NASA's planned program for developing this 
technology for tzuiiUbody configurations with flexible appendage*. The approach is to 
start with a relatively simple- configuration and to progress to more complicated and 
realistic ones as the techniques are refined. Even though these are ground tests, they 
can be very expensive, A well coordinated program is seeded to make it cost effective. 


Need for a cooperative government, university, industry data base program . My third 
and last point is on the need for a cooperative data base among NASA, Air Force, 
industry, and mmenitim to make the most effective data. Figure 

g shows the major test elements planned in NASA's control of flexible structures 
program. It is envisioned that these will include exteasive and. costly... eapeiime«ts..:p& 
tjbe grotmd and "on -orbit." ' extensive ground and flight 

test activities. The nation will only be able to afford a limited number of these test 
programs. Therefore, It is vital that the data be made available to a broad 
cross- section of the controls technology community, I recommend that NASA 
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Figure 4, Sole of ground and on-orbit testing in c ontrol/ structure 
interaction technology development (From final report of the 
NASA ad hoc Advisory Subcommittee on Control/ Structure interactions). 
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Figure S. Examples of multibodf dummies models considered in M ASA's 
control of fl«»We . structures- program (From 1 , H. t-imett-of N ASA- ■Headquarter^'): 








and the Air Force develop a national cooperative data base for these major ground and 
on-orbit experiments. The controls/stroctures technical community - including 
universities, industry, and government laboratories - should be involved in establishing 
the experiment and data requirements. A data base should be established so that many 
of the different techniques for modeling and controlling of structures could be tested 
and validated. There must have been at least SO different ones discussed at this 
workshop alone. The data base should be devised in such a manner that it can be easily 
accessed and used without extra programming by each user. It should be available at a 
low enough cost that universities could use the data in relatively small research grants. 
If these requirements are considered and included at the outset, there should be 
minimal impact on cost and schedule of the programs. 



Figure 6. Test model descriptions for NASA's control of flexible 
structures program (From R. H. Russell of NASA Headquarters). 

It will not be possible to rerun these experiments with each new analysis technique 
or c^ntrv’ concept. It also gets very expensive to re-ana tyze experimental data with a 
second o. third researcher, if the data are not already in a good user format. With 
rppropri -~e pre-planning and setting up of a multi-user data base, the cost for 
r.dditiouil analysis by alternate researchers should be quite small. 

In conclusion, my three main points were that: ground and flight test experiments 
will be necessary to have high confidence in our systems identification and 
controls/structures interaction design techniques; a well coordinated program is needed 
that involves analysis, ground testing and flight testing; and, lastly, an interagency 
cooperative data base should be set up for all major ground and flight test programs to 
achieve the most effective use of the very rare experimental data. I would like to 
commend NASA on constructing an excellent program plan for developing and 
validating the technology for control of flexible structures as presented in Mr. Russell's 
paper. 

EDITOR'S NOTE: This concludes the formal remarks made by the panel members. 
There follows a discussion period between the panel members and audience participants. 
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R. V. RAMNATH: I would like to share with you some of my experiences, interests, and 
aspirations in the area of asymptotic analysis. Asymptotic methods have been used 
with great success in many branches of the physical sciences, mathematics, physics, 
celestial mechanics, mechanics, etc. It is my feeling that asymptotic methods have not 
been used as well as they should have been in the areas of large space structures, 
structural dynamics, and control. There have been a sprinkling of papers using 
perturbation methods and asymptotic analysis in the last ten or twenty years. Professor 
Lions' paper highlights some of the issues. We have been working in this area for 
sometime. I would like to tell you about our experience. 

Asymptotic Analysis . We all agree that real physical problems are very hard to pose 
exactly, and, even if they can be posed, they will result in highly nonlinear coupled 
equations, for which closed form analytical solutions are hard to obtain. For example, 
large flexible space structures are described by partial differential equations with 
variable parameters. Even worse, these parameters may be nonlinear functions. For 
such problems, it is very hard to obtain closed form solutions. So what we do is try to 
get some kind of an approximation. Historically, approximations have been tried as 
long as people have been analyzing real problems. Asymptotic analysis is one way of 
approximating the problems, and we have used it fairly successfully. 

I would like to define asymptotic analysis as a study of applied mathematical 
systems in limiting cases. There seems to be a little bit of confusion in the literature 
about the convergence and the asymptoticity of equations. These two words are used 
interchangeably. We can discuss the difference in the context of a simple example. 
Consider a slowly decaying exponential function exp (-at), where a is a small 
parameter. The series representation of this function is convergent within an infinite 
radius of convergence. But, in order to get the approximation by truncating the series, 
one has to be careful. If the argument is very large, while the series converges, it 
converges very, very slowly. Thus, if one takes a few terms to represent the series, one 
gets very poor results. Thus, this is an example of a convergent series that is not 
asymptotic in certain domains. Also, there are other functions which diverge 
everywhere, but for certain values are known to be asymptotic. 

Asymptotic analysis is considered as an art and science by practitioners. The 
science is what mathematicians worry about. For example, they worry about how to 
calculate these functions, etc. On the other hand, engineers are faced with unknown 
parameters, so that there is some kind of Judgement required. This is the art side of 
the problem. 

Asymptotic analysis can be used in simplifying complicated problems. The 
principle of simplification states that in the asymptotic limit the system simplifies. 
Professor Lions talks about the need for more asymptotic analysis in his paper, mainly 
because it leads to simple descriptions of very complicated systems. 

The principle of minimum simplification or maximum balance states that there 
exists a representation of the problem which retains most of the prominent features of 
the problem, while it is simple enough to obtain a closed form solution to it. The real 
problem may not be amenable to a closed form solution. These two principles are not 
theorems but give a guideline for approximating problems which I have found very 
useful. 
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Simplicity is obtained basically by neglect of terms im the mathematical model, 
but if one throws away too many terms, the problem posed may be useless. Thus, an 
optimum has to be found. I have found that this principle of minimal simplification or 
m a x i m al balance is very useful in this regard. There are a lot of ways of simplifying a 
system, but a particular approach that I like is that of asymptotically ordering a 
complicated system. Asymptotic ordering puts a complicated system into a 
hierarchical structure, where you start with the simplest system and add on to i'; 
another sub-system which is a little bit more complicated. Thus, one achieves some 
kind of ordering and a cascade of systems of greater and greater complexity. Then one 
stops when the systems become too complicated to solve. 

We have applied this to model the space shuttle. It has a hO-state differential 
equation describing rigid body motion, flexible body motion, control actuator dynamics, 
etc. The hierarchical structure could be obtained because we could differentiate by 
time scaling. This approach is called the multiple scale approach. 

Multiple Scale Approach. This approach consists of taking an independent variable in 
the differential equation and extending, its domain into a space of higher dimension. 
When the problem is restated in this new space, the problem gets simplified 
considerably. 

For example, a dynamic phenomenon wbich exbibits qualitatively different 
behavior, such as fast motion combined with slow motion, can be converted into two 
problems. The independent variable, time, will have two scales, one for the fast motion 
and the other for the slow motion. With these two new independent variables, two 
partial differential equations can be written in the place of one differential equation. 

Physically, it is of interest to think of the following experiment. Imagine there are 
observers with clocks observing a phenomenon. Bach clock counts time on a different 
scale. Thus, each observer observes only certain aspects of the phenomenon which may 
be different from the others. The observer with the fast clock observes the fast 
changes of tbe dynamics while the observer with the slow clock observes only slow 
changes. Thus, it is possible to separate all the motions and then combine them 
together to get a composite type of representation. Now, one solves this new problem 
in higher dimensions asymptotically and requires tbe solution to match with the solution 
of the original problem along certain lines wbich are called trajectories. Thus, we are 
able to get back to the original problem. 

Application s. The multiple scale approach was used to solve the problem of 
electrostatically controlling the membrane mirror of a space antenna wbich would be 
used in radio astronomy experiments. A membrane made of Mylar is preferable because 
it is light and can be folded. Thus, it is convenient to carry in space. The shape of this 
membrane is to be controlled by an electrostatic field generated by electrodes. Even 
though the electrodes are finite, the electrostatic field is continuous. Thus, a 
continuous membrane is to be controlled by a discontinuous force. If the shape of the 
membrane is a perfect circle, the mode shapes are give by Bessel ..unctions. But when 
the shape is not circular, solving for the mode shapes is very difficult. But, our 
experience tells us that this is where art comes into play. The independent variable can 
be split into two variables that are nonlinear functions. Then, a solution in terms of 
Bessel functions can be obtained, whose arguments are these nonlinear functions. Thus, 
one obtains an asymptotic solution for the mode shapes of tbe membrane. 
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The payoff of this asymptotic solution is that we can understand the dynamics of 
the model asymptotically. The error is not zero, but it is not large. We are able to 
think of a feedback solution for the control problem. The solutions have a Bessel 
function of a different kind with the feedback gain embedded in its arguments. Also, 
we can study its stability and controllability and come up with a constructive procedure 
to calculate feedback gains and stability. 

Conclusions : It is my feeling that more and more studies would indicate the usefulness 
of approximations using asymptotic methods for problems which are very difficult to 
solve analytically. The advantages of asymptotic methods are that one obtains 
analytical or quasi- analytical solutions to these problems. This helps in studying the 
properties of the system, such as stability and controllability. Also, the methods can be 
used to solve control problems. Bven though the errors are not zero, they are rather 
small compared to other methods. So I want to reiterate Professor Lions* suggestion of 
looking at asymptotic solutions of distributed parameter systems. 

QUESTION : There are a range of opinions on Space- Station controls, some of which say 
that it is just another Skylab. If that is true, why can't we merely use th v control laws 
from that program? 

RAMNATH : I want to paraphrase what R. A. Frosch said, "We are not sure of what we 
are doing when we abstract the real world." When we develop models for these things, 
we cannot know the importance of modeling errors until we fly. T erein lies the rub. It 
depends on the performance requirements of the Space Station. If the performance 
requirements are benign, we can use Skylab technology. If you are asking for 
complicated requirements, that require control systems bandwidth* less than the order 
of hours, then we should worry about flexibility. 

CARLISL E: I think it is reasonable to assume that experiments that require very 
precise pointing would have some control system Just as they did in Skylab. What 
concerns me the most is that we have to be able to hold a large station stable enough to 
rendezvous and dock with -virtually zero energy transfer. Then we want to be able to 
evolve. The evolution is "to be determined". We Just do not know what its nature is. 
The question is: what can we put in the first one as a control system architecture that 
will give us flexibility to evolve in the future? I think if we go with Skylab technology, 
the control system is going to bound the evolutionary growth premature *v. 

RBDiBSS : I think that the available controls technology has got to be self-fulfilling in 
terms of the type of space station one might have. For example, if we rely on the 
technology that presently exists, which was adequate for Skylab, we are going to be 
limited to a type of space station that is compatible with that or suffer very expensive 
modifications in space at some later time when we want to expand that capability. 

HCBLTON : It was mentioned earlier that we cannot afford to do full scale testing in 
space. What evidence do we have that we can afford not to — in other words, it may 
turn out to be less expensive in the long run. 

CARLISLE : The problem is: if we have to have full scale testing in space to validate 
new technology, and the system says we cannot afford it, then we are bound to live with 
something less than the potential technology that control analysis and research shows is 
very promising. 
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QUESTION : In various meetings in which I have participated recently , I bwe sensed an 
opinion that we have had a lot of effort on theoietical development directed toward 
large space structures. There seems to be an unjustified impsti ' with theoretical 
developments, and a concern that researchers are going off in tne wild blue yonder 
trying to address problems that may not exist. The impression is that it is more 
appropriate to develop an extensive experimental plan and defer further .vaphasis on 
theoretical activities. I would like to get the panel t.o comment cn this. 

CARLISLE : I think research is very promising. However, in the world of program 
management, unless we validate the research, it will never get used. The question is, 
how do you get your kicks? In pure research, you can get your kicks by doing analysis 
and identifying a promising application. Then, you get bored with it and wart to take 
on another new idea. If nobody else picks it up, develops feasibility, a.id validates your 
first idea, that idea will never get used. I think there is a necessity for all phases of 
research. I think it is terrific to do fundamental work. I think it more imporant that 
research be developed to the point that there is enough confidence in it to be used. 
That is a tough decision. I have had some experience with program management. I can 
tell you that, when I was running a major program, I was not interested in research 
unless I had a critical problem. When I had a critical problem, I needed all tue help 1 
could get. I would take as many chances as I had to, to meet my obligation as program 
manager. Unless I had that critical need, I did not have any incentive to add complexity 
in maturing an experimental technology. A program manager has no incentive to do 
that. It has to be done in the pre-development phase. This relates to my concern on 
Space Station. We have five years to mature promising ideas. I am not satisf i that 
we are paying enough attention to sort out which are the n ost promising view.. With 
an evolutionary approach to the Space Station, we should not gc with last year’s 
technology, we should go with next year's technology, with a plan to evolve to the 
future --do something even better. I feel quite strongly that there is a necessity for 
several views and that we need to coordinate all those in some complementary way. I 
also believe that there is a personality clash between the types of people with original 
ideas and those required to carry out the applications. We have to do something as a 
community to improve the communication and cooperation, so that we can sort out the 
really promising ideas and get them used. The more new ideas get used, the more the 
whole discipline will proceed. Soon the rest of the Space Station will be doing things 
with structures that we never did before, because the control system technology is 
needed. I do not think we are really up to speed now in that area, and I would like to 
encourage it. 

QUESTION : As program manager of a Space Station program, would you be satisfied to 
incorporate methodology in control systems or operations that you had not 
demonstrated in some kind of space experiment? 

CARLISLE : It depends. That is a point I tided to touch on earlier today. If the 
technology is too expensive to mature full-scale in orbit, then the problem is to gain 
confidence to use it the first time full scale. We flew the shuttle, and we did not do as 
much flight testing on the shuttle as we do on aircraft. When the first Shuttle flew, it 
was a brand new thing in space. We had enough maturity, confidence, and experience, 
so that was a good decision. 1 also think that the decision to use new technology varies 
with the experience of the decision maker. You think of your own field. If somebody 
suggests a new technology to you, you get comfortable based on your own experience. 
If somebody suggests something in a field foreign to you, and you are a good researcht* 
or engineer, you will have all kinds of doubts, without the experience to back them up, 
and you will never buy the proposal. 
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SKELTO N: I would just like to make & point About the problem with waiting until there 
is a need for a new method Vefor ' we start worrying about it. We do not have a 
continuity that serves as a basis for » continuous flow of ideas into the system. One of 
the things that strikes me as odd is that in this country, and in the western world, there 
is no institute of significant eize that focuses on control and system science issues, e,s 
there is in the Soviet Union. The Soviets have more Wan a dozen institutes, sponsored 
by the Academy of Sciences, each one employing 3 to 4 thousand engineers with 
advanced degrees. They have the responsibility to carry on research all the way from 
the very theoretical to the applied and experimental in system topics such as control, 
identification, artificial intelligence, and all of their interactions. It seems to me, if 
you look at our organization charts, we find guidance and control blocks somewhere 
along the line. That narrow view of it may have covered the subject 30 years ago. I do 
not think it does today nor it will tomorrow. I am wondering, where in the western 
world it should be covered, whether it should in the Air Force, in NASA, or somewhere 
else. I think everybody L. expecting the other guy to do it. There is no consistent 
substantial support of the size and focus required, to address t^e Lsues that we deal 
with. When we launched the S* ju V vehicle, we had inter' -tion problems between 
propulsion systems and control. The”, we had interaction problems between the 
structural dynamics and controls. We can name a number of instances and examples 
where we defined the control concept too narrowly We nought of it as a little black 
box to be added to the system after eveiything else was done. I think the controls 
community has a much broader view today than it did 10 vears ago. However, in terms 
of the organizations that sponsor research, there is still only a very small segment that 
understands this broader view. 

CARLISLE : What constructive actions are suggested? 

SKELTON : One concern is this. We should determine whether or not to create an 
institute on systems science or control syst :mx issues, or whatever you want to call it. 
There may be good reasons for not creating such an institute, but there should be a 
study to decide. The decision not to do it should be a conscious effect. Let us not lose 
the technological leadership by default. 

VANDBR VELDE : I would like to comment on your concern about combining control 
theory with other disciplines. We see this phenomenon at MIT (1 assume you do also at 
Purdue): the graduate students today very commonly study quite a good deal in control 
and structural dynamics. This interest has been stimulated by the problem of large 
spacecraft control, which they look upon as an interesting challenge. It was very 
uncommon a number of years ago for our students to study those two areas very much 

CARLISLE : I would like to comment that in the last four or five years, within the 
sphere of NASA we have made a significant change. Funding for controls has gone up 
by a factor of 10 or me .e. We have this visibility that was talked about -- tne 
necessity of addressing the problem of strictures and control dynamics interactions. 
We have visibility in the budget and are looking for further expansion. I think that we 
have a thing going now because of this recognition. This is a powerful tool and can 
make a big difference in Space Station. We should try to pay attention to it »nd take 
advent, ge of that exposure. 

SKELTON : 1 agree there is progress. 
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BALAKRISHNAN : I just want to nuke a comment on an earlier statement maintaining 
that structures students have discovered control theory. Actually, the number of 
students going into controls is much more "mailer than it used to be. At UCLA, many 
more are going into communications. I have a question regarding the evolutionary 
design concept. Is this something new for NASA, or has NASA had any expc dence doir r 
something like that before? What kind of comments from university, industry and 
scientific communities are taken before such decisions are made? Is there any formal 
mechanism, or does somebody just make the decision? 

CARLISLE : I think the opportunity for permanent presence in space just spontaneously 
developed the evolutionary idea among those of us who have worked on it. The only 
way to achieve permanent presence is by service and maintenance. The Shuttle gives us 
the opportunity to evolve. I think we are in the infancy of even understanding what we 
mean when we say that. I do not know of any formal way that we consulted with 
anybody. It was just handled spontaneously, with a very preliminary plan which could go 
in any direction. The system engineering and integration task on the Space Station has 
been planned to be done within the agency with contractor support. Though the 
program is very new, we have been planning technology for a couple of years. But we 
are really in an infancy state. I meant what I said. I do not think we know what we 
mean when we say evolutionary design, except that we know we do not want to be 
bounded with last year's technology. We want to open it up to take advantage of the 
Shuttle and logistics end have it grow. 

QUESTION : Is the project pretty definite? 

CARLISLE : It is definite that we did not yet win a vote in Congress, but we did win the 
vote of the President. In his speech of last January, it was one of the things that he 
quoted as br : ug one of the future commitment in space for this country. 

R BDIBSS : I would like to go back to the point R. E. Skelton raised about the institute in 
control. In part, it relates to my experience within NASA, particularly in the controls 
area. I think there is a long term fundamental outlook within NASA of not really having 
treated controls as a science and discipline. It has been treated more as an engineering 
area. It takes quite a bit of effort for an organization like NASA to turn that around. 
There are times where we have received substantial increases in support in the area. It 
is due largely to program managers such as R. S. Carlisle, who understands controls and 
its "'sal difficulties and concerns, and who provides personal support. When major 
projects come along that clearly require some new technology in controls, there is an 
opportunity to increase the support for developing the technology, particularly towards 
the more costly end of it. I think there still is a difficulty in really recognizing controls 
as an equal disciplinary area with some the more well established disciplines, which 
have had decades of support with NASA. There are a number of people who are aware 
of that within NASA. It is just a difficult institutional thing to turn around. 

QUESTION : Do you think it ’» a reflection of th. r evaluation of the relative 

importance of the subject? 

RBDIBSS : No, I think the problem is that it is often treated as an engineering problem 
which can be solved through a good engineering approach-as compared with developing 
the fundamental concepts, methods, and tools as a discipline, such as we do in 
aerodynamics, materials and other disciplines. 
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VAN DBR VELDE : In Aeronautics, we are all familiar with the term control- configured 
aircraft. In fact, NASA has done work in-house and has sponsored a certain amount of 
work in that area. You never hear the term control- configured spacecraft, even though 
you mentioned the requirement this morning for integrated design. It seems as if there 
may be perhaps more to be gained in the large spacecraft area through the integrated 
design of the configuration, assuming from the onset that tl ere will be some form of 
control. That would really be a control configured spacecraft. 

CARLISLE : I think aeronautics is ahead of space technology in the integration of 
structures and controls. 

SKELTON : I think the universities should take some blame for this problem, hi the 
past, students would take a dynamics course, and they were told: this is what >ou want 
to model, and this is how you model it. They take a controls course, and they hear: if 
that is a model of a system, then this is what you do with it. However, there is a big 
world in-between for which we are here assembled. I think the universities have to 
raise a new breed that understands integration of the disciplines. At Purdue, we are 
really concerned about the integration of disciplines, and we are now revising our 
curriculum accordingly. Structures and dynamics have had a ISO-year history of 
development of very sophisticated methods to answer specific needs. Even though 
controls is in its infancy (being only a 50-year old rJjject), it has grown very rapidly 
and has achieved some sophisticated level. On the other hand, the premises upon which 
each of these two disciplines are based fall apart when put together. I do not think it is 
possible to take formally trained structural designers and «sk them to understand 
enough about "controls" to do the job. Conversely, it is not possible to ask formally 
trained control engineers to learn enough about structures to do the job. I think you 
need the universities to take it upon themselves to merge those disciplines at the 
fundamental educational level. 

RED1ESS : I would like to come back to one other point. There was a presentation by R. 
A. Russell from NASA Headquarters on a program plan for the control of flexible 
structures. I do not know how many of you had an opportunity to hear him, but it is in 
part responding to a special ad hrc subcommittee that was established by NASA to look 
at the control of flexible apace structures. That particular subcommittee was a 
combination of controls and structure: experts. I personally feel they did a very good 
job of reviewing NASA's program and making some recommendations. I also find that 
the particuu plan put together in responding to that is an excellent plan. I have had 
involvement for two years in trying to put together these types of plans for NASA. 1 
think the people who did th j should be commended, because they present an approach 
that really couples the disciplines of controls and structures. Often, when we have 
tried to put together such a program, we ended up packaging in one plan, a controls 
program and a separate structural dynamics program. I think this plan brings the two 
disciplines together. 1 would like to encourage that NASA implement as much of it. as 
possible. I understand that it has been favorably received up to this point. I fully 
understand tbe problems of funding these programs, since they get to be extremely 
expensive. I would encourage them to proceed and accomplish as much as possible. 

CARLISLE : I do not tau ^ if you are aware, but that was one of two specific new 
thrusts for next year's budget. So, we in NASA are trying to proceed. I suggest that 
the universities lelp. If the government and universities can get together to recognize 
the problem, it might help. 
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REDIBSS : The plan proposed was indeed a combination of analytical and theoretical 
work up through significant ground and flight testing. I think it really is responsive to 
the community's needs for technology. 

BALAKRISHNAN : I just wanted to say that we must not forget the scale of things. In 
microelectronics, we have projects at the level of $20 million in the universities. As 
you mentioned, your NASA project is coming along. However, if each grotq> is given 
only $10K to carry on and participate, you will not get major results. That kind of tb*ng 
has to change if you want large scale involvement. 

SKELTON : I would like to turn the question around to the audience. I am on the 
Aeronautics and Space Engineering Board (ASES) of the National Research Council. A 
committee of the ASEB is looking at the relationships between faculty, industry, and 
universities. I would appreciate those of you who have specific concerns or ideas on 
how those relationships can be improved to drop me a note, or somehow register those 
thoughts, so that I might take them into account. The concern is that in the 
universities a lot of our potential graduate students are not going on to higher 
education. If you have concerns that would help tc broaden my own perspective in 
formulating ideas and getting points of view, I would certainly appreciate hearing them. 
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OF FLEXIBLE SPACE STRUCTURES 

H. A. Redicss and N. Nayak 
H. R. Textron, Inc. 

Irvine, CA 92714 


INTRODUCTION 


The main objectives of the workshop were: 

- to provide a forum for exchanging ideas and thoughts on how to effectively 
control large flexible multibody spacecraft; and 

- to identify the important unsolved problems of current and future interest 
leading to possible future collaborative NASA/University/Industry efforts. 

The workshop was organized with several sessions addressing the major technical 
and theoretical issues through invited and contributed papers, three panel discussions, 
and a final wrap-up session. The workshop agenda is presented in Appendix A. The list 
of attendees is presented in Appendix B. The purpose of this report is to present a 
technical evaluation of the workshop that synthesizes the most important results, 
conclusions, and recommendations for future research. 

The workshop covered a wide spectrum of issues involved in the identification and 
control of flexible space structures. Many different concepts, ideas, and novel solutions 
were presented in this workshop. A new trend in the workshop was visible. There was a 
consensus that an integrated approach be adopted to solve the complex and challenging 
problem of control of large flexible space structures. A me ' >g of control theory, 
structural design, and materials among others can produce * snt solution to the 
problem. 

Since NASA is planning for the deployment of large space structures, such as space 
station and antennas, it was felt that much of the theory that has been developed has to 
be validated by doing laboratory experiments both on the ground and onboard in space. 
Of course, the~e is a continuing need for more basic research in many areas to refine 
the ideas and concepts and to develop new solutions. 
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The recommendations outlined below are believed to represent a consensus view 
from the workshop and draw heavily on the results of the panel discussions and the 
papers presented. One should not conclude, nor is there any intent to imply, that all 
participants of the workshop or the wrap-up panel discussion endorsed these 
recommendations. 


MAJOR RECOMMENDATION FOR AREAS OF FUTURE RESEARCH 

1. An integrated approach to modeling, control law design, and optimization of space 
structures 

Rationale : Future space structures will be very large and complex with stringent 
performance requirements. Thus, the control requirements for slewing and stabilizing 
the structure, and for providing shape control and vibration suppression, necessitate 
careful review. To minimize control problems, these control requirements must be an 
integral part of the structure design process. Since a failure-proof control strategy 
satisfy^ these requirements does not exist, it is important that additional research be 
focused on structural modeling and design problems from a controls viewpoint. 

2. Model reduction and robustness 

Rational : Because of the Urge size and light weight, the space structure will exhibit a 
very Urge number of flexible mode shapes with low frequencies that will interact with 
the controllers. To control such structures, it is imperative to model the structure very 
carefully. To keep the model reasonably tractable from a computational point of view, 
a reduced order model has to be obtained. Some new results are now avaiUble to give 
guidelines in obtaining reduced order models, but good definitions and techniques are 
needed to measure robustness. 

3. Interdisciplinary approach in the control of flexible structures 

Rationale : The control of a Urge flexible space structure is a complex and challenging 
problem. In order to keep the control simple, functional, and tractable, all the 
available adjustable parameters must be exploited. For example, by changing certain 
material properties, the amount of power required for control can be decreased. Thus, 
by considering options offered through the material and structural properties involved, 
the control problem can be allevUted considerably. Researchers and engineers are 
recognizing the potentUl of this interdisciplinary approach. 

4. Adaptive control, estimation and identification 

Rationale : Some Urge space structures, such as space station, will undergo changes in 
their be dynamic characteristics. Adaptive control is potentUlly suitable under such 
conditions, but there U a need for extens 4 ve research. In addition, for adaptive control 
to be effective, a good estimation and identification scheme is needed. 
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5. Control of distributed systems 


Rationale : Since the proposed space structure will be large, it can s e viewed as a 
distributed parameter system. Results obtained in a distributed parameter setting can 
give a good understanding and insight into choosing a proper control strategy. Control 
of distributed systems can be viewed as a limiting case of lumped parameter system 
control. Thus, one can exploit the strong theoretical results of distributed parameter 
systems theory to study the limiting behavior of tumped parameter systems. 

6. Ground and flight experiments 

Rationa le: There were a considerable number of laboratory experimental results 

presented at the workshop. These experiments are necessary to validate theory. There 
should be one or more benchmark test articles set up where various 
control/estimation/identification algorithms can be tested and compared to provide a 
better understanding of the overall control problem. Carefully selected and well 
designed on-orbit experiments should be conducted to further validate identification 
and control techniques. Particular emphasis should be placed on validating ground 
testing techniques and techniques for extrapolating ground test data to the space 
environment. The program should be well coordinated going from snalysis, to simple 
experiments, and to full-blown realistic experiments. 

7. Sensor/ Aetna tor placement and development 

Rationale : A proper sensor/actuator set on a flexible space structure can be highly 
effective in controlling the structure. Large and unique space structures will probably 
need different types of sensors and actuators. 

8. Control system fault detection and tolerance research 

Rationale : It is important to detect the faults and failures of sensors and actuators in 
time to control flexible space structures properly. This is particularly critical for 
stringent performance criteria. After fault detection, the control must be able to 
reconfigure to the new and smaller set of sensors and actuators. 

9. Nor linear control 

Rationale : Because of tho complex nature of the flexible structure, nonlinearity may 
be a part of designing a good control law. More attention needs to be paid to this area 
to develop a more mature and practical theory. 


OTHER R E C OMMBND ATIONS 

1. Real-time processor control laws 

High speed of computation (10 to 100 million operations per second) 
High reliability and durability 

Software languages, verification, validation, and fault tolerance 

2. Nonlinear state estimation 
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3. Active disturbance isolation for Space Station 

4. Analysis of high order structural systems with slightly nonlinear stiffness properties 

5. Theory of asymptotic properties of systems 

6. Applications of random field theory to estimation and system identification 

7. Combined treatment of identification and control problem accounting for modeling 
errors 

8. A cooperative program among NASA, DoD, universities and industry to make the 
most effective use of limited data from major ground and flight experiments 
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APPENDIX A 


PROGRAM SCHEDULE * 


WORKSHOP ON 

IDENTIFICATION AND CONTROL OF 
FLEXIBLE SPACE STRUCTURES 


June 4-6, 1984 

Hyatt IslandU Hotel 
San Diego, California 


National Aeronautics and 
Space Administration 


let Propulsion Laboratory 
4800 Oak Grove DrWe 
Pasadena, California 91109 


Langley Research Center 
Hampton, Virginia 23665 


G. Rodrigues, Technical Program Chairman 


*■ EDITOR'S NOTE: 


This is the final agenda for the workshop including the order in which sessions were 
held and papers presented. For convenience, the order of some of the papers and 
sessions was changed before compilation of the proceedings. 
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WORKSHOP ON IDENTIFICATION 
AND CONTROL OF FLEXIBLE 
SPACE STRUCTURES 


)une 4-6, 1984 


Hyatt Islandia Hotel 
San Diego, California 


OBJECTIVES 

The main objective of the Workshop is to explore the application of state-of-the-art modeling, estimation, identification and 
control methodologies to the control of flexible space structures. The Workshop responds to the rapidly growing interest within 
NASA in developing the new control technology required for large flexible space systems (platforms, station*, antennas, flight 
experiments) currently under design. These systems, much larger than any spacecraft flown to date, must satisfy very stringent 
performance requirements. The Workshop will provide a forum where leading researchers can share ideas, procedures and results 
on theory and methodology, as well as on practical experience with the emerging flexible space structures. 

The Workshop will consist of surveys, tutorial and contributed papers, and open discussion sessions in the following areas: 

MISSIONS OF CURRENT INTEREST - Space platforms, antennas, flight experiments, space station. 

CONTROL/STRUCTURfi INTERACTIONS - Dynamics modeling, distributed system theory, integrated design and 
optimisation, maneuver designs, attitude control and stabilisation, shape control. 

UNCERTAINTY MANAGEMENT - Parameter identification, model error estimation/compensation, adaptive control, robust 
control, fault detection, modular control, growth accommodation. 

EXPERIMENTAL EVALUATION - Ground experiment demonstrations, flight experiment designs. 

Sponsors 

National Aeronavtics and Space Administration; Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA; 
Langley Research Center, Hampton, VA. 

Steering Committee 

A. K. Bryson, Jr. - Stanford; J. F. Garibotti - H. R. Textron; V. O. Hoehne - APWAL; S. M. Joshi, L. W. Taylor, Jr. - 
NASA-LRC; D. Mclver - NASA Headquarters; D. L. Mingori, J. S. Gibson - UCLA; G. Rodrigues, Z. Z. Sslrmay, A. F. Tolivar 
- JPL; W. E. Vender Velde - MIT. 
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MONDAY MORNING 


8:50-9:45 (I) 
Chairmen: 


June 4, 1984 
P LBNARY SBSSION 1 


Regency A 


G. Rodriguez. Jet Propulsion Laboratory 

L. W. Taylor, Jr., NASA Langley Research Center 

NASA Space Controls Research and Technology Program 
R. W. Key and D. Mclver 

National Aeronautics and Space Administration 

AFWAL Control Technology Program 
V. O. Hoehne 

Air Porce Wright Aeronautical Laboratories 

NASA LSS Missions of Current Interest 
A. P. Tolivar and J. B. Dahlgren 
Jet Propulsion Laboratory 


CODING: MA » Monday morning 
MP * Monday afternoon 
TA * Tuesday morning 
TP » Tuesday afternoon 
WA > Wednesday morning 
WP * Wednesday afternoon 
(I) * Invited Paper 



MONDAY MORNING 
June 4, 1984 


SESSION MAI; K(|ncyA 

CONTROL OP SPACE STATIONS 

Chairman: I B Dahlgren 

let Propulsion Laboratory 

1000 10:30 

A Multivariable Control Trrbatyi for a Coatrol Coafifand 
Space Station 

I. W. Sunkel, NASA lohnson Space Center 
A. P Hots, McDonnell Douglas Company 

10:30-11:00 

A Dual Spin Space Station Design 

M. A. Pslussek, C S Draper Laboratory 

£ 11:00 11:30 

^ Dynamics and Control of a Deal Spte Space Station 

I *■ Velman and f. J McRtman, Hughes Aircraft 
Company 

1130 13:00 

Aotonatic Assembly of Space Stations 

P.K.C. Wane. University of California, Los Angeles 

13:00 12:30 

Space Station Dynamic ModoHng, Diaturbonca 
Accomm o dation, and Adaptive Control 

S I Want. - C. lb. T H. Lin end B. MelUer 
let Propulsion Laboratory 


SESSION MA2 Recency H 

FLEXIBLE MliLTfBODY DYNAMICS AND CONTROL 

('hsmnsr '1 M | until 

NASA 1 sngley kenvsrrh Onier 

10 00 in .Ml 

A Redactat Transformation for Dynamics Modeling of a 

Cluster of Contiguous Flexible Structures with Constraints 
R P Singh. Honeywell, Inc 
P W l.iktns. Lehigh University 

10 30 11:00 

Vibra tion/Llbra tion Interaction Dynamics and Control 

During tbe Orbuer Eased Deployment of Flexible Members 
V | Modi end A M Ibrahim, University ut British 
Columbia 

11:00 11:30 

Design of Multivariable Controllers Using the Integrated 

Analysis Capability (IACI 

f A Boss i, University ot Washington 
S A. Wtnkleblsr k, G A. Price. Boeing Aerospace Co. 

11:30 12:00 

Simulation cf Spacecraft C rntrol/Stmcture Interaction 

Uetap Structural System Dynamic Analysis Software 
I W. Young. SDRC. Inc 

12:00 12:30 

Attitude Control TredvoT 9 v Be ween e Flexible Beam 

end a Tether Configurt j» 

S. H. Graff, let Propuisir^ Laboratory 


ntSION MAE: t agency C 

MODELING AND MODEL REDUCTION 

Chairman: f. Sesak 

General Dynamics Co. 

10:00 10:30 

Aa Overview of Modeling and Control of Largo PlasfMe 
Space S tree tares 

I E. Santiago. W. I Lange. Jr, and M. lamahidl 
University of New Mexico 

10:30 11:00 

Pregnancy Domain Control of Largo Spa^a 

Structures i A P recti cal Approach 

A. Das. R. Harding. General Electric Space Center 

11:00-11:30 

Model Redaction Baaed on Pregnancy R espouse for Control 
System Design 

D. Ism, Honeywell Systems and Research Center 
11:30-12:00 

A Control Concept for Urge Flexible Spacecraft Utteg 
Order Reduction Torhnjgnss 

G. Thieme and K. Roth, Do miar- System GMBH, West 
Germany 

12:00-12:30 

Wideband Disturbance Accommodation U fTrrrttlnu FlttMt 
Space Structures 

D. R. Hegg end G f. Kissel, C.S. Draper Laboratory 



SESSION MPI Regency* 

CONTROL OP LARGE T1NNAS 

C'hetrman V \\ rtochnc 

Ah Loire Wright Acumauucat Ubontones 

1:45 1 M\ 

Dynamic Performance of Several Large Antenna ConceyU 
('. C Andersen ami I. H Garrrli . NASA Langley Research 
Center 

R E Calleson, Rent ion International 
J 30 3:00 

Antenna Pointing of Large Flexible T* unmini rnirw 
Spacecraft 

B Covin and A Bousquel, MATRA Space Branch. Prance 
3:00 3:30 

Design and BnJmtha of Control Systems for Large 
C omm a * rations Satellites 

M E. Stieber. Communications Research Centre. Canada 

3 30 4 00 

Control of Large Antennas Baaed on Electromagnetic 
Performance Criteria 

Y H Lin. M Hamtdi, F Manshadi. let Propulsion 
Laboratory 

4:00 4:30 

Vibration Control Experiment Design for the I1M 
Hoop/Cofnmn Antenna 

P M Ham and D C Hyland, Harris Corporation 

4 30 5:00 

A Hardware Demonstration of Distributed Control for a 
Flexible Offset- Peed Antenna 

0. B. Schaechter and N C. Nguyen, Lockheed Missiles and 
Space Company 

5:00-3:30 

Model Development for Spacecraft Pointing and Shape 
Control 

J . Sesak, General Dynamics Corporation 
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SESSION MP2. Regency B 

CONTROL OP DISTRIBUTED PARAMETER SYSTEMS 

Chairman PRC Wang 

bniv* rsiiy of California. Los Angeles 

I 43 2 30 (h 

The M1PAC Facility: investigation of Properties of 
Distributed Parameter Systems 

D. L. Russell, University of Wisconsin 

1 30 3:00 

Finite Control in Unde (damped Distributed Parameter 
Systems 

l). I . Inman, State University of New York, Buffalo 
3:00 3:30 If) 

New Directions M Asymptotically Stable Finite -Dimensional 
Adaptive Central of Linear Distributed Parameter Systems 
M. Belas, Rensselaer Polytechnic Institute 

3 30 4:00 

Factorisation Approach to Optimal Control Design 
M H. Milvnan, let Propulsion Laboratory 

4:00 4:30(1) 

Estimation of Parameters In Distributed Models for Large 
Space Structures 

H. r. Banks, brown University 

I. G. Rosen, C.5. Draper Laboratory 

4.30 5:00 

Space Station Parametric Models 

M H » midi and S. |. Wang, let Propulsion Laboratory 


SBSSION MP3: Regency C 

MODEL REDUCTION AND ROBUSTNESS 

Chairman: A. F. ToUvar 

let Propulsion Laboratory 

1:45-3:30 

Study on the Control of 3rd Generation Spacecraft 

B. J. Davison and W. Gating, University of Toronto, Canada 

2:30-3:00 

I anas r/ Ac tee toe Selection for the Cenetrsined Variance 
Control Pr ob l em 

M. L. DeLorenso, R. S Skelton, Perdu University 
3.00-3 30 

Itgaarrafcaa Assignment by Constrained Optlndsation 

S. M. DeCaro, D. I. Inman, Sute University of New York, 
Buffalo 

3:30 4:00 

Matrix Transfer F u nc t ie u Central Dasign for flnbli 
Structures An A pp li cation 

T. (. Brennan, A. V. Compito, A. L Doran, C. L. 

Gesuftoe. C. L. Wong, Aerospace Corporation 

4:00-4:30 

R o bu st Control Design for Largo Spa ce fi ncum 

W. L. Eastman, Me Donnell -Douglas, Houston 
f. A. Boasi , University of Washington 

4:30-5.00 

Influence of Structural Parameter Uncertainties an Flexible 
Space Structure Central Performances 
L Passeron, Aerospatiale, France 


3:00 5:30 

Ou the Stability of C alloc a tad CoutmUen M the Pi 
of Uncertain NonlineariUee and Other Perils 
5. M. lot hi, NASA Langley Research Center 


MONDAY EVENING 


7 ; 00-4:00 


DDfMBt 


RXGBNCY 



TUBSDAY MORNING 
June 5, 1984 


SESSION TA1 Regency A 


SESSION TA2: Regency B 


DTNAMCr KD CONTROL EXPERIMENTS 

Chairman R A RmmI! 

NASA He-Jquanc.s 


• IS 9:00(0 

Ltrt« Smcc Su9ctm DyiaNct and Conti A MinHimsu 
| Pfinon, Air Force Wright Aeronautical Laboratories 

9 00 9:30 0) 

Control T WtriMt Sincum Crowd lyrtMt Plana 
• - A RestHi, NASA Headquarters 

9:30 fOtOO 

Space Station Configuration rad Plight Dynamics ID 
E Me tiler and M Milmaa, let Propulsion Laboratory 

£ 10:00 10:30 

5o Cur-ant Status of NA^a/BEB Control Design Cfcattenge 
L. W. Taylor, Ir., NASA Langley Research Center 

10:30 11:00 

Large Space Structure Flight Experiment 
S. |. Wang, let Propulsion Laboratory 
D. C. Scfcwao, Lockheed Missiles and Space Company 


11:00 1130 

Time -Optimal Ba^Bang Slew of EftgNksed SC OLE 
Configuration 

|. C. Lin, Northeastern University ani Control Research 
Corporation 

L. W. Taylor, fr., NASA Langley Research Center 
11 30-13:00 

NAS A/MS PC Ground Experiment for Large Space S true turn 
Control Verification 

S. M. Seltaer, Control Dynamics Cosupany 
H B. Wsites, NASA MarshaU Space F^ght Center 


INTXC RATED MODELING. DESIGN AND OPTIMIZATION 

Chairman: G. T. Tseng 

Ae r ospace Corporation 


R:IS 9:00 

Approximation of Optimal PbiU-Dlr-v Wil 
Compensators for Flexible Structures 

I. 5 Gibson, D. L. Mingorl. A. Adamiaa, and P. fabbari 
University of Calif on Los Anjales 


900 9:30 

Control of a Flexible Space Antenna: A Finite- Dimensional 
Solution Baaed an Distributed Parameter Theory 

D. L. Mingorl, |. S. Gi b e on, P. Blellock, and A. Adamian 
Driven ily of California, Los Angelas 


9:30-20:00 

An Integrated Control said RUmn Mesa Sunctural 
Opt Indas flan Algorithm for Large Space Structures 
A. M ass ac, |. D. Turner, and I. Sooaar. C. S. Draper 
Laboratory 


10:00- 10:30 
Ch rscteristic Rlaatic 


Sy st ems ef Tim- Limited Optimal 


A. L. Hale and R. I- LUoviski, Department or Aeronautical 
and Asironautical Baglneertag, University of Illinois, 
Urbans Champaign, Illinois 


10:30 11:00 
ISAAC (Integrated ! 
Co 


I Analysis and Control) via 


C. L. Gustafson, M. Aswany, A. L. Doran, and G. V Tseng 
Aerospace Corporation 


1 1:00 I l:S0 

Contiui/? tree tare Optimisation 

M. HamidJ, M. Seisms, and L. Demset?, >t Propulsion 
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11:30-12:90 

Panel Disruption: Panel will discuss the relationship 
be twee n the various Integration approaches currently wrier 


SESSION TA3: Regency C 

ADAPTIVE CONTROL 

Chairman: R. W. Montgomery 

NASA Langley Research Center 

0:13-9:00 (I) 

Adaptive Control - Actual Status and Research Trends 

T. D Landau, Labors toirc d'AulomaUque de Grenoble, 
Prance 
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A Nonlinear Dual -Adaption Control Strategy far 
MnnUfltation raf Control of Large S pac e St r u c ture s 
P. 1. Thau, City College of the City University of New 
York 

9:30-10:30 

Stable Dftroct Adaptive Control of Linear 

Infinite- Dimensional Systems Using a Command ^winter 

Tracker Approach 

M. I. BaUs and H Kaufman, Rensselaer Polytechnic 
Institute 

10:00 10:30 

Self -Taring Adaptive Controller Using -Mine Fr e q u e n cy 
Identifies Lien 

W. W. Ctuaag and R. H. Camon, Ir., Stanford University 
10:30 11:00 

Adopt i ve Control for Large Space S tr u e to rus - A 
Clueed-Porm Filtering Srtn Usn 

H. E. Ranch and D. I. Schaechter, Lockheed, Palo Alto 
Research Laboratory 

21:00- 1 1:30 (!) 

Robust Algorithms for Adaptive Control 
K. S. Narendra, Tale University 

11:30-12:30 

Panel Diacnstlon: Panel will discern the rule that adaptive 
control theory can play In the control of flexible space 
structures. 
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June 5, 1984 


SESSION TFI Rr.ency A 

CONTROL/STRUCTURE INTERACTION EXPERIMENTS 

('Human I A Hrealwr-ll 

Lorkherd. f«lo Alto Researrh Lab 

145 2 30 f|> 

PlttdMe Kokot for Space hppUcalioM 
R. Gran. Grumman Aeros^aft r ompany 

2 30 3 00 

KxpartneMa *a Stractval Dynanci and Control Using • 
Grtd 

t. C. Montgomery and N Sundararajan. NASA Langley 
Research Center 

3:00 3:30 

The Coa p iig/ Controlled Up trtaw uUot t# a 
Two-DHawiicaal Hyperbolic Syittfi 

i Yam, University o! California, Los A j^les 
MU .n and D H. Staetta, MIT 
T L. loNtaat, Boll Beravk and Nt n^a Inc. 

3:30 4 00 

latk for m iTpariMisfl Tad for FtoriM* Space 

Sira **Titadc Control 

Tb G Heimbold, B. Schafer. H. Holsach. DFVLR. 

FUG 

4:00 4:30 

Namrkal and lepsrtmontal Iwlaaik* for Single- Ams 
Control of on LSS Laboratory M od a l 

Y Ohkami, O Okamotx S. Yotha.imra. V. Bide, and I. 
^ameguelu, National Aerccpate Laboratory, Japan 

4:3a# 5:00 

Me. a janaa t of Mit^riai Da my mg In a SlamUtod Space 
F^mmaal 

w iberg. Su <u <t i ’ *r 'veraity 


SESSION TF2: Rageiicy W 

PARAMETRIC AND NONLINEAR CONTROL 

Chairman M. Ha midi 

let Propulsion Laboratory 

I 45 > 30 

Parame tric SUIimii Control of Flaxlble Stroc tores 
k t\ Moon and ft H Rand. Cornell University 

2:30 3:0** 

Vibrational Sta Dili* slice ** Plexie!' Uue tores 
M 7a k. Jet Propulsion La be ratio 

3:00 3 30 

Stiffness Control or Large Spore Structures 

J. L Fanson, I C Chen, lei Propulsion Laboratory 

3 30 4:00 

Sob- Optimal Control of Nonlinear Flexible Sp'.ce Structures 
I Dehaghanyar, S. P. Masn. R. K Miller, and G. A. 
be key, University ol Southern California 
T K Caughey, California Institute ol Technology 

4:00 4:30 

EKBAE 

4:30 S:S0 

Panel Discs ilon: IntordiaclpUnary laauts In tha Control of 
Floafhlo Stfucturoa 

Tentative Participants: f. F. GarihotU. Moderator; D. L 
Mtefori. UCLA; V. B Venkayya. APWAL; A. Nayak, H 1 
Textron; f Sesak, General Dynamics Co. 


3:00-5:30 

Optimal Ir ;i*a* m <•* P' nping MtOirt tiUoa liyirtnmu 
D B Eld red |<‘ ' rcvlsios Laboratory 


SBSSION TPS: Regency C 

ESTtMATTONADENTTPlC ATION 

Chairman: L. W. Taylor, Jr 

NASA Lang'ey Research CenUr 

1:45-2:30 <1> 

Ma tkamm Likel ih oo d Estimatioo with E m pba afs on Aircraft 
PUgbt Data 

E. W. lUff. NASA Ames Research Center 
Dryden Plight Research Facility 

2:30-3:00 

Optimal 5 ana nr Locationa for Structural identification 
P. t. Udvradia, University of Southern California 
; A. Garba, Jet Propulsion Laboratory 

S:00-S:30 

ComMnod State and Para mater EeUmaticn for a SutAc 
Modal of tha Maypole (HovfSCohmsa) Antenna Surface 
H. T. Banks. Brown UnirorsHv 
P. E. Lamm. Soutbero Methodist Univertiiy 
E. S. Armstrong, NASA Langley Research C«n*er 

3.30 4.00 

Eaparlmra f eJ Verification of f tentlfleatson Algorithms for 
Control o. Flexible Structure 

B. Srtdhrr, f. N. Auhrvn. and K R LoraU, Lockheed Palo 
ai*o Research Laboratory 

4:0O«:3O 

An Eigenrynem Realisation Algorithm (BBA1 for Modal 
mtar Mantlflcatlun 

N. Joans and R. S. Pappa, NASA Langley Research 
Center 

4:30 5:00 

A F u nc t ion Space Ap p r oa c h to Mido a n UkoM h n o d 
Ideati/icatAon of Paraamtara in Elliptic System* 

G. Rodrigues rod R. Scbcad, Jet Propulsion Lebotaiu 


TUESDAY EVENING 


4:30 t:30 


NO HOST SOCIAL PERIOD 
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WEDNESDAY MORNING 
June 6, 1984 

♦PLBNARY SESSION 1 -AMERICAN CONTROL CONFERENCE 

8:30-9:30 Regency Room 

CHAIRMAN: Herbert B. Rauch, La sheet*, Palo Alto Research Lab 

CO-CHAl.>\MAN: Leonard Shaw, Polytrc*.mic Institute of New York 

KEYNOTE ADDRESS: Getting It All ri ntfer Control 

RoLert A. F roach, Vice President, 

General Motors Research Laboratory s 


*This plenary session constitutes the initiation of the 1984 American Control Conference. While the session 
is not part of the Identification and Control Workshop, all workshop participants are welcome to attend at 
no additional cost. Because it was anticipated that this plenary lecture by a former NASA administrator 
would generate significant interest among the workshop participants, no other workshop activities hare been 
scheduled in this time period. A workshop plenary session described on the Fallowing page has been 
scheduled immediately following the ACC plenary session lecture. 



WEDNESDAY MORNING 
June 6, 1984 
PLENARY SESSION 2 

9:45-1:00 0) *•*“'» A 

TITLE; FUTURE RESEARCH DIRECTIONS 

CHAIRMAN: H. A. Redies*, H. R. Textron 


Uncertainty Management Methodology for Large Space Structures 
W. B. Vender Velde, MIT 

Some Estimation and Identification Problems in Random Fields 
A. V. Balakrishnsn, University of Cslifornia, Los Angeles 

Some Trends and Problems in Control Theory of Distributed Parameter Systems 
J. L. Lions, I.N R.I.A. and CoUege de Prance 

Igffd el Error Structure and the Inseparability of the Control and Identification Problems 
R. R. Skelton, Purdue University 

The Role of Experiments in the Development of Control Technology 
H. A. Rediess, H. R. Textron 

Fut ur e Research sad Technology Directions for Space Station 
R. P. Carlisle, NASA Headquarters 

Panel Discussion: Panel will discuss the state -of- the- art in modeling, estimation, identification and 
adaptive control and will identify potential research opportunities for NASA consideration. This 
wrap- up te trl iu * will also provide a forum for all participants to further comment on the papers and 
issues presented and to contribute recommendations. 
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